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PREFACE 


1. The present study is based on experimental research in speech- 
sound made in the phonetic laboratory of the Tokyo School of Foreiffn 
Languages, which was designed and established by the author under 
the auspices of the School. And its primary object is to investigate 
into the nature of accent. 

2. Most of the previous experiments in accent and intonation have 
been conducted without strictly scientific equipment, and even when 
the kymograph was used, pitch-curves alone were resorted to, stress 
being, to a large extent, ignored by reason of the difficulty met 
with in measuring it accurately. In view of this, an attempt has been 
made in the present study to improve the conventional system by the 
use of oscillograms of speech-sound. With the help of oscillograms, 
pitch and stress of sound have been measured and represented in 
curves with scientific accuracy. It is here that subjective judgment 
comes in with the result that the nature of accent has been thoroughly 
expounded. 

3. Firstly, a study has been made of the pitch, stress and sonority 
each vowel comes to possess when pronounced by itself. Secondly, 
classifications have been made from the standpoint of “Gestalt" of 
monosyllables and bisyllables of seven languages, i.e., Japanese, 
English, German, Chinese, Korean, Spanish and Malay. As a result, 
it has been revealed that each language is possessed of “tone-form” 
peculiar to itself and that various types and kinds of accent are re- 
cognizable in those languages which have casually been labelled 
either as pitch accent or as stress accent languages. 

4. In order to facilitate the study of intonation and melody, various 
phases of “tone-form" have been singled out and further study has 
been made on the impression each gives to human ear. 

6. A language as material of speech, contents to be expressed, 
manner of pronunciation, tone, etc. may be mentioned as factors in the 
determination of a certain concrete form of speech. From the phone- 
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tic point of view,, however, the moat important is the first factor, i.e. 
lansruage, to speak more precisely, its speech-sound, syllables, word- 
structure, kinds of accent, syntactical patterns, etc. In order to see 
how the peculiarities of these affect the general tone of speech-sound, 
an examination has been made of many sentences in nine different 
languages, viz. English, German, French, Japanese, Chinese, Korean, 
Hindustani, Russian and Mongolian, each of which is distinguished by 
its own characteristic type of accent. As a result, it has come to be 
clearly understood what word accent, collocation accent, and sentence 
accent are, what position they take and how they are utilized in the 
sphere of language. 


June 30th, 1936. 


T. CHIBA. 


ADDENDUM: 

Since in the present work, tone quality is not strictly dealt with (an 
investigation into tone quality will be made and published later) , some 
of the phonetic letters used as symbols for pronunciation are similar 
to the Roman script. 

The phonetic signs for Chinese, Korean, Hindustani, Russian, Mon- 
golian, etc. are those which are being actually employed in the depart- 
ments of these respective languages in the Tokyo School of Foreign 
Languages with the exception of Korean, (phonetic symbols used in 
the case of this language being those adopted by the Korean Phonetic 
Society, etc.). In the cases of Hindustani and Mongolian, the symbols 
employed are not so much phonetic transcription as the romanization 
of their alphabet. 
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CHAPTER 1 

. INTRODUCTION 

I. 1. Definition of Accent and Other Technical Terms. 

The word accent as a genera] term was originally derived from the 
Latin “accentus” which pertained in its first stage only to the pitch 
of speech-sound; but there has since been a gradual shift in meaning, 
and now the sense of the word varies in different languages, and has 
wider and more divergent applications than formerly. I am confident, 
however, that the definition and use of the term may be stated as fol- 
lows, when the subject is considered generally and in a brief manner: 

I. Accent is said to fall on that syllable of a word which' has either 
a stronger stress or a higher pitch than any other syllable. 

II. When there is a definite relation either between pitch or between 
stress among the syllables that form a word, that relation is called 
accent. 

The inadequacy*’ has often been dwelt upon by many of an attempt 
to distinguish the accented syllables of a word from the unaccented. 

Definition I. has usually been resorted to for the sake of convenience 
in teaching certain languages; from the standpoint of phonetics, 
however, definition II. is preferable. Some languages have more 
or less established arrangements of pitch m the syllables of the word, 
while others have those of stress. The former are languages with 
pitch accent and the latter with stress accent. For instance, in the 
Japanese word takk, so long as the ta syllable is lower in pitch than 
the Tck, it is immaterial whether ta is stronger or weaker. From this 
point of view, therefore, it may be said that Japanese is a language 
with pitch accent. Again, in the English word “knowledge,” if only 
the syllable know- is stronger than the syllable -ledge, it makes little 
difference whether the initial syllable is higher or lower in pitch than 
that which follows. From this, English can be described as a lan- 

• See Scripture. “Elements of Experimental Phonetics,” p. SOS. 

See Dr. K. Sakuma: “Pronunciation of Japanese,” p. 388. 
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guage with stress accent. 

In determining, with reference to a given language, whether it be- 
longs to the “pitch accent" group or the “stress accent" group, re- 
course should be had to the following criteria : 

(a) To ascertain whether the pitch and stress in the syllables of a 
word are not subject to change, in whatever position the word may oc- 
cupy in a snetence — it should be borne in mind that in such a case the 
terms pitch and stress will hereafter be used, not in the sense of abso- 
lute pitch and sti'ess, but to indicate the relative pitch and stress within 
the word itself. 

(b) To make sure whether a given word is liable to be deprived of its 
peculiarity when the pitch and stress are made to vary in many ways. 

(c) To find out which remains immutable, the pitch or the stress, 
free from personal equation, when one and the same word is pro- 
nounced by different individuals. The immutability, however, does not 
wholly account for the quality of accent, but only partially. 

In view of these facts, the definition of accent may now be stated ; 
accent denotes such a relation of pitch, stress and speed of sound as is 
characteristic of a given word. If the stress typifies a word, it should 
invariably remain constant, and so should the pitch for the same rea- 
son. In this way, we can distinguish the stress from the pitch accent. 

How the accentual peculiarity of each element gives it a feature, 
when several words form a sentence, or conversely what sentence (re- 
garding the construction, tone and phraseology) has such and such a 
kind of word accent, or whether or not only “syllable accent" is allow- 
ed to exist, or what it is like, if it admits of “sentence accent" besides — 
all these questions only the future can solve. In this study, however, 
some attempt has been made to suggest a possible answer to these 
questions. 

Different persons have adopted different terms for pitch, stress and 
speed of speech-sound. Indeed, several words have been used to de- 
note one obj'ect only, or vice versa. No wonder a good deal of com- 
plexity has been caused. To define the term correctly, it becomes 
necessary to understand thoroughly the object in view. In the termi- 
nology suggested below, conflicting terms will be avoided as much as 
possible for fear of confusion with those that have already been 
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adopted. I shall now give terms best suited and define such as have 
already been named. 

(a) Tone may be defined, in contradistinction to phoneme, as a com- 
prehensive term for the note of a language (including pitch, stress and 
time) : e.g. the Hindu language is of monotone. 

(b) KnehS and gocKo, though analogous in meaning, are more or less 
limited in application as compared with tone : e g. gekietsu naru ku- 
chS or violent tone ; totsu totsu taru gocho or faltering tone ; bubetm 
no nen o amwashita gocho or contemptuous tone. The corresponding 
term in English is also tone. 

(c) Style is a mode of speaking: e g. conversational style; recitative 
style; stage dialogue style, etc. 

(d) Intonation may be defined as the variations which take place in 
the pitch, stress and speed of the voice in producing speech-sound : e.g. 
one and the same sentence may have quite different shades of meaning 
with different forms of intonation; we often hear a remark such as: 
"His speech is rich in intonation.” 

(e) Stress is defined as the degree of force in utterance. The ob- 
jective volume of the attributes of the voice, is the pressure amplitude, 
frequency and energy (or intensity) ; this last being in proportion to 
the square of the product of the pressure amplitude and frequency. 
The stress, consequently, means both subjective strength of the voice 
and force of breath. 

(f) If we speak with sti'ess, the voice becomes stronger and higher. 
By a stressed syllable is meant, however, a syllable that is strong in 
stress but not necessarily high in pitch. 

(g) By means of emphasis, a certain word can be made especially 
prominent and can be given much force. Emphasis does not always 
refer to “strong utterance” alone. The emphatic effect may be pro- 
duced by the use of a sentence well suited to the purpose. It may 
sometimes happen that an ordinary sentence is emphasized by reason 
of due expression. 

(h) Emphasis is often used to attach weight to some words, phrases 
or sentences according to the sense or purport of the speech or accor- 
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ding as the usage of the language dictates. This, however, has nothing 
directly to do with pronunciation. 

(i) Exaggeration is defined as a method by means of which, the 
pitch, stress and speed are made to vary more than regularly. Wheth- 
er or not this succeeds in giving any marked effect psychologically is 
quite another matter. 

(j) When speaking in this book of the tone as rising, falling or level, 
this does not refer to the stress but to the pitch alone. 

(k) The definition having already been given, the term accent will 
do also in Japanese. Gosei is, however, the corresponding Japanese 
term. 

It seems that the words “accented sound” in a language of the 
“stress accent” group, refer to such sounds as give the hearer a keener 
impression psychologically. The term “akzentuieren”, on the other 
hand, is intended for such sounds as give greater acoustic effect — gen- 
erally, stronger, higher and longer — physically. As a musical term, 
accent means louder sound of the intensity rhythm. But when it comes 
to the accent of words, accent may be said to occur where A syllable 
is either stronger or weaker than B, it being, so to speak, a mere cove- 
nant in the sphere of pronunciation. Moreover, in some cases, we have 
the level accent when two syllables are equal in stress. Thus, “ac- 
cented syllable” comes to lose much of its raison d'Hre. It can safely 
be said, therefore, that what has been called by the term “accented 
syllable”* should be designated with terms such as “syllable pronounc- 
ed with stress”, or “the strongest syllable”, or “the highest syllable”, 
or “syllable that gives the greatest acoustic effect,” instead of the de- 
finition that has previously been accepted. As is usual in colloquial 
speech, in which words are often given double meanings, the word “ac- 
cent” is often used as synonymous with “stress”. It may be advisable 
to use the word in this application distinctly from yet another sense 
of the same word such as in “accented syllable”. Happily, German 
has such a fit term as “betontei Silbe”, whereas neither Japanese nor 
English has this advantage. Although Dr. Sakuma* adopts in his 

* See Dr. K. Sakuma: “Pronunciation of Japanese,” p. 382, 
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works some such phrases as “syllable at the peak of accent”, and “syl- 
lable at the top of accent”, these cannot be said to be invariably ap- 
propriate. In the present thesis, therefore, I shall use a suitable word 
according to circumstances. 

I. 2. Equipment for Experiment. 

Before treating the main subject, a brief survey wili be given on the 
experimental equipment that has furnished the data. 

The experimental work consists of two processes : i. e. the photo- 
graphing of the sound wave with the aid of a condenser microphone, 
ampiifier and oscillograph* ; and the measurement of the frequency, 
amplitude and time of the sound as recorded on the film. 

It goes without saying that this method is by far superior to that in 
which the kymograph or other instruments are used for the registra- 
tion of the sound wave. The sensitiveness of the microphone, how- 
ever, cannot be constant with varieties of frequency of sound. In case 
a frequency that the microphone can feel most sensitively approxi- 
mates to that of the formant of a certain vowel, the amplitude of that 
vowel is magnified more than in any other vowels. It is a pity that in 
this way absolute accuracy is not to be expected in the course of the 
experiment. 

To measure exactly the relative amplitude of a vowel, first of all, a 
complete analysis of the vowel is necessary. The result will be ad- 
justed then through the sensitiveness i. e. amplitude frequency of char- 
acteristic of the microphone, and after that it will be made into wave 
form by a synthetic process. This done, the measurement of ampli- 
tude is possible. To repeat the exact process with each individual 
vowel, however, is very complex and is (moreover) hardly practicable. 

Besides, too much strictness is not required for the present purpose. 
The records on the film are accordingly used without any modification. 

However, the pitch of sound can be measured precisely. The subjec- 
tive pitch of sound corresponds with the objective frequency, the form- 

* See Dr. G. Obata: "Experimental Acoustics,” p. 117. 
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er being in proportion to the logarithm of frequency, within the scope 
of speech-sound. The pitch of speech-sound, however, generally 
varies within the range of one octave, and within this scope the loga- 
rithmic scale does not differ much from the ordinary one. In addi- 
tion, with speech-sound, the relation between the actual change in fre- 
quency and the psychological interval is no simple logarithmic one. ‘On 
the contrary, there is a vast difference between the two in the rapid- 
ity of change in pitch and stress. Consequently, it becomes necessary 
that for each pitch-curve a different form of gradation should be 
given if the change in the psychological pitch is to be represented most 
minutely by means of the pitch-curve. The physical scale is preferred 
for this reason, while at the same time considering the economy of 
time and labour necessary for the complex enumeration. Thus, the 
pitch and stress will be made to correspond with the frequency and 
amplitude respectively. In the graphic diagram of the pitch-stress- 
time curve that will be given in the following sections, the abscissae 
and the ordinates stand respectively for time (unit: 1/50 sec.) on the 
one hand and pitch (above) and stress (below) on the other. A syl- 
lable has been chosen as unit, instead of each separate sound, in the 
interpretation of the curve, for the former system has the advantage of 
being legible. Vowels are greatest in amplitude (stress), next come 
nasals followed in turn by liquids, voiced consonants, J and t/ in the 
given order. The other voiceless consonants are very small in ampli- 
tude or are held for so short a period of time as not to be perceptible 
in the diagram. 


I. 3. Pitch, Stress and Sonority of Vowels. 

Previous to the classification of accent, it may be considered to what 
extent human ears can follow the different pitch and stress of each 
vowel. 

With whatever equal pitch and stress, a, i, m, e, o, may be produced 
the investigation of the film record of the resultant sound waves re- 
veals a considerable difference in amplitude between the vowels. 

The table given on the next page shows the amplitudes (the upper 
rows) and frequencies (the lower rows) measured from the oscillo- 
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grams of the five Japanese vowels uttered by eight different persons. 
The column on the extreme right is for the figure representing the 
mean amplitude of the five vowels as produced by each individual, 
obtainable when the average amplitude of the five vowels is fixed at I. 
Fof instance, in the case of the speaker M, the amplitude of i, is 6.0 
m.m., the pitch 198, the average amplitude of the five vowels being 
12.0 m.m. Supposing that this last is fixed at I, the figure for the 
amplitude of i, stands at 6 m m./12 m m.=0.5. 

Even when a, i, «, a, o, are pronounced successively, the pitch of 
each varies with individuals. Generally speaking, however, the pitch 
is lowest with a and gradually rises with i and u reaching its highest 
with u and e, after that, it is again on the decline with o. Variations 
in pitch are more marked in the case of the female voice. As for 
amplitude, it is smaller with 4, u, and larger with o, e, a. 


Pitch and Stress of Japanese Vowels 


Froaanneer 

a 


u 


0 


M 


1.37 

6.0 

Xg8 

O.S 

“o!7 

SIX 

0.89 

Ha 

'1.13 


1 13 

tnta 

13.0 

T 

«S 

150 

1.00 


oAa 

m 


187 

i.p8 

Z0.0 

Z87 

*-54 

6-5 

N 


1.0S 

4 -S 

136 


5.0 

*39 

0,88 

50 

136 

X.BS 

8.0 

127 

X 40 

5-7 

y 

6 0 
*35 


3 -a 

x6e 

B 

7-0 

163 

IS9 

7*3 

z68 

X 24 . 

6 0 
*55 

X.OS 

59 

A 

«3 0 
a *4 

0 93 

X 3.0 

338 

0.85 

* 4-5 

*34 

1.03 

14 0 
334 

0.99 

17.0 

333 

1.21 

14.X 

K 

*3 5 
*37 

1.15 


oA 

95 

*49 

oAi 

11.5 

156 


B 

*-45 

II 7 

F 


r,i7 

30 

179 


6 0 
303 

0.78 

85 

*78 

X.Zo 

■SB 

» 56 

7-7 

H 

7-7 

J73 

o.8x 

9-0 

19a 

■ 

K 3.0 

193 

X.36 

10.0 
x 83 , 

. *.t >5 

g.o 

X90 

095 

9-5 




■I 

B 

■■ 

B 

■i 

I2a 


1.36 



i6o X7S >74 i8i 178 


The average pitch of the eight different voices is 160 for a, 175 for t, 
174 for M, 181 for e, and lastly 178 for o. The average amplitude, on 
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the other hand, is 1.06, 0.65, 0.95, 1.06 and 1.36 respectively. 

As is clear from the foregoing, in case the vowels are separated from 
one another for an appreciable amount of time in utterance, a differ- 
ence in pitch, if any, may not distinctly be observed. The pitch varies 
to a marked extent in the production of one single vowel. As ‘for 
stress, it varies considerably with different vowels, not only in the case 
of one individual but also when represented on an average basis with 
regard to many. To find out the cause, a method may be suggested of 
changing the degree of opening of the lip-passage in utterance of the 
sound. In normal speech, a is greatest in amplitude, next comes e, 
followed in turn by o, i, u, in the given order. But if the five vowels are 
all formed with the lip position feri, (it is of course impossible, I know, 
to produce each exactly in this way; but the peculiarity of each can 
still be retained with comparative ease; thus, i may be so pronounced 
as not to be confused with c,) they are almost level (with one another) 
in amplitude. Similarly, when pronounced with the regular lip-position 
for e, they are very much like e, m amplitude. Therefore, it may be 
said that the stress very largely depends upon the degree of opening of 
the lip-passage. Of the eight speakers, the one who pronounced all the 
vowels with similar amplitude is the one with whom little variation was 
perceived in the lip-position. Conversely, the variation in amplitude 
points to such an individual as pronounced with more labial movements. 

Thus it seems that there is no definite relation between amplitude 
and pitch. 

That the amplitude is smaller with i, u, and that this smallness in 
stress IS not so distinctly observed as in others is, in my opinion, not due 
to the question of the sonority of these vowels. (Although Prof. Jmbo 
defines sonority as relative stress of each unitary sound as pronounced 
with the same degree of force of utterance, here I shall regard it as 
basic quality to determine which of two unitary sounds with equal am- 
plitude and frequency is more sonorous irrespective of the force of 
breath of the speaker.*) It is a well-known fact that the sonority be- 
comes altered in the pure tone if the pitch varies, even though the am- 
plitude remains constant. For instance, a sound with a frequency of 

* K. Jlnbo' “Japanese Phonetics,” p. 96. 
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2,000 is much more sonorous than another with a frequency of 50 that 
is equal in amplitude. 

The question now arises whether a sound with a relatively small 
amplitude can produce by reason of its sonority the same psycho- 
acoustic effect as another with a greater amplitude. As will be illus- 
trated in the following examples, if we receive stronger acoustic im- 
pressions from them than from others, even i and u arc larger and 
higher in amplitude and pitch. After all, it seems that human ears are 
not dependable in comparing two vowels, at least with quantitative 
stress as a sole criterion. Thus, we have now to solve the question how 
the subjective stress is measured. It may be said, I suppose, that any 
two sounds that are level in both amplitude and frequency are usually 
so in sonority too. In consequence, the stress of a sound will hereafter 
be represented by its amplitude. 

It is possible to apeak in such a way as to make the stress and pitch 
vary independently of each other: e.g. we can sing a song with the 
same stress as in speech or alter the stress while the pitch remains quite 
unchanged. Generally, as has been explained, the voice becomes 
higher as well as stronger, when we speak with stress. 

Thus, in the case of normal speech-sound with comparatively less 
intonation or a word uttered by itself, a syllable at the “peak of 
accent” is usually higher and stronger than any other. As far as one 
single syllable is concerned, it is often observed that the stress and pitch 
run inversely: i.e., the higher the pitch the weaker the stress becomes 
or vice versa. 

Without any further remark on this phenomenon, an attempt will be 
made at an objective classification of accent with regard to pitch and 
stress. The data used in this investigation are from eight languages, 
i.e., Japanese, English, German, Spanish, Chinese, Korean, Hindustani 
and Malay. The speakers are those who speak their respective 
mother-tongue in standard type of pronunciation. 
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CHAPTER II 

TYPES OP ACCENT AND SOUND PSYCHOLOGY 

U. 1, Classification of Types of Accent in Respect of 
External Forms, 

In the classification of the types of accent, words of two syllables 
have mostly been treated, while at the same time reference has been 
made, when necessary, to polysyllables. 

(A) The stress and pitch vary proportionally throughout a word. 

(B) The stress and pitch of the two syllables vary inversely. 

(C) The stress remains almost unchanged, but the pitch varies. 

(D) The pitch scarcely changes, but the stress varies. 

(E) Both pitch and stress change, but only very slightly. 

(F) The stress remains approximately constant, but the pitch rises 
in the first and falls in the second syllables. 

(G.H) The stress hardly varies, and the pitch-curve takes a mount- 
ain shape in the initial syllable, and is on the decline in the second. 

(I) The stress scarcely alters, and the pitch-curve makes a triangle. 

(J) The pitch ascends, and the stress changes in different ways. 

In the following, each will be studied in detail. 

(Ai) The stress and pitch vary proportionally all through the word 
and each syllable makes a mountain shape (when represented in 
curve) . 

In Fig. 1. are shown the 
pitch and stress curves of Erhe. 

Erde, Erhe, Ehre, horen, leu 
der, and many other German 
words are of this type; bony 
shows an example in English. 

Many Mongolian words be- 
long to this type, in which, 
however, the last syllable is 
the highest and strongest. 



Fig. 1. Erbe 
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This occurs mostly when a word consists of vowels and voiced con- 
sonants alone. Those words which contain voiceless consonants do not 
fall under this class. 


(AJ (Because of the voiceless consonant, the pitch-curve is broken 
and) the correspondence between the two curves is destroyed to a 
certain degree, but not at all fatally (Fig. 2). German Vater, loschen 
English pity, merry, gather, bushy, china, toilet, etc. are all grouped 
under this class. 


In the case of (Ai) type, there is a 
close correspondence between pitch 
and stress, and so a syllable can be 
represented by the highest points of 
both pitch and stress. As the two 
points do not regularly coincide in 
(Aj), a syllable cannot be represented 
in this way. Words of this type can 
be found, however, in large numbers 
in those languages which have “stress accent.” The stress-curve at 
the height, therefore, is made to represent the syllable : in Fig. 2 



Fig. 2. toilet 


(toilet) Fs,?^: Qs, Q's are the points representing the syllable. Fig. 
3 & 4 are represented by co-ordinating pitch and stress. The abscissae 
and ordinates stand for pitch and stress respectively. This is done 
after having fixed the points representing the two syllables of the 


words in ( AJ and (A 2 ) . 


The result may be somewhat influenced by personal peculiarity, as 
the data are from the speech of an Englishman and a German (respec- 
tively). Though it varies with different words even in the case of 
speech by one and the same person, still on an average, the interval 
between the two syllables is, in German, a fifth and the ratio of stress 
(amplitude) 1.4; and in English, a fifth and 2.6 respectively. 
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This type is, however, very rare in Japanese even when a word is 
uttered by itself. Moreover, the highest points of the pitch and stress 
never coincide, so that it is difficult to fix the points representative of 
the syllable. The separate studies of pilch and stress reveal that the 
interval between a “syllable at the peak of accent” and another rarges 
mostly from a second to a fourth and amounts in rare cases to a fifth, 
an interval much smaller compared with that in English or German 
which attains a fifth on an average. But in recitation or stage dialogue, 
the interval is sometimes extremely exaggerated with the result that it 
often reaches a major sixth. The ratio of stress for the dissyllable is 
given below in tabular form. It varies considerably with different 
words and is not so regular as in English or German. 


hoto (pigeon) 1.0 togon (gourd-melon) l.S 

«ajfee(wine) 1.6 oifei (autumn) 1.1 

6a6a(old woman) 1.4 tro (colour) 2.0 

a^(space) 1.0 sa/^e (salmon) 1.5 

Aofa(flag) 1.6 mochi{rice dumpling) 1.2 

(At) There is no close relation between the curves of pitch and 



Pig. 5. mochi 


stress (the curves of pitch 
and stress do not run 
parallel to each other), 
but at least the order of 
magnitudes of the two 
curves coincide (Fig. 5). 
The dissyllables and poly- 
syllables given below be- 
long to this class: 


mochi, akari, laboratorial, organ, feverish, pitiless, goldfever, baker, 
clearly, birthday, houses, etc. 

(A4) With the dissyllable or polysyllable, the pitch-curve is of 
mountain shape reaching its highest in a certain syllable and the 
stress-curve goes up a little at some point in each syllable. (In this case, 
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may 

sometimes be broken ow- 
ing to an intervening 
voiceless sound. By link- 
ing it, however, it will 
make a complete running 

T,. « , curve). 

Pig. 6. obasan 

In addition to obasan (Fig. 6.1, Japanese eipo (cinema), mori (har- 
poon), baba (grandmother), kbrogi (cricket), soramame (beans), 
English murmur, murmurously , etc. are of this class. 

(As) This somewhat resembles the type (Ai) in having the pitch 
climax in the initial syllable. The initial syllable is, however, a little 

stronger in stress than the final one, 
and the very steep pitch-curve char- 
acterises this type. The rising portion 
of the pitch-curve is almost coexten- 
sive with the first syllable towards the 
end of which it reaches its height. 
Fig. 7. shows an example. This type 
is frequently found in Spanish: e.g. 
punto, vtrgen, silla, cerro, bajo, hoda, etc. 
The interval of the rising pitch is a 
minor fifth on an average when the stress and pitch are on the increase 
concurrently, and is a minor third when the pitch ascends suddenly 
after the stress has considerably increased. In both cases, the average 
interval of the falling pitch is a seventh. 



(B) In the dissyllable, the order of magnitudes of the pitch and 
stress of the two syllables vary inversely. 

As is shown in Fig. 8. the “syllable at the peak of accent” has smal- 
ler stress, though higher in pitch. Japanese niisan (brother), German 
&ber, etc. are of this type. 
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Fig. 8. fiber > Fig. 9 . Beute 

The diphthongal Beute tboyta] baker [beikaj, etc. cannot strictly 


be described as dissyllabic, [y] and [i] are, however, weaker but 
higher than [o] or [e] ; see Fig. 9. Close vowels such as i, y, ii, etc. 


are, judging from this, liable to be pronounced with a higher pitch. 



(C) The stress hardly 
varies, but the pitch does. 
Japanese aid (space) 
(Fig. 10.) is an excellent 
example; so also Japa- 
nese tori (bird), English 
pity, etc. 


Fig. 10. aki 



0 10 20 


Fig. 11. morau 

(E) A certain syllable produces 


(D) The pitch stays al- 
most constant, and the 
stress either varies or re- 
mains immutable. Japa>. 
nese morau, togan, kanemo- 
chi, taw, etc. are of this 
type (Fig. 11.). 

stronger psycho-acoustic effect 


and yet differs but slightly in pitch and stress from others. This, how- 
ever, is not of the same type as what Dr. Sakuma calls "flat-accent”. 
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Only the Japanese language has words of this type: e.g. ai (love),«- 
ni (sea-urchin), ine (rice-plant), somtname (beans), aozora (blue 
sky), etc. In Fig. 12. uni is illustrated, and in this the sudden eleva- 
tion in pitch is particularly noticeable. 

(F) As in the (A/) type, the pitch- 
curve is of mountain shape, only much 
steeper. Unlike that type, however, 
the height is on neither syllable; 
though the pitch rises in the initial 
and falls in the last syllable, the pitch 
climax is somewhere intermediate 
between the two. In addition, it more 
or less resembles the (E) type in that 
it gives an impression of "stop” on 
account of the sudden rise in pitch. 

The stress also gains or diminishes 
according as the pitch rises or falls, 
and the stress-curve assumes a trian- 
gular form with a vertex much inclin- 
ing toward one direction. There is 
little difference in stress between the 
two syllables. ' 

Malay furnishes a good many ex- 
amples of this type : e.g. tiram, belot, 
asap, orang, tmang, ulat, etc. The in- 
terval of the rising pitch is about a fourth when the rising occurs initi- 
ally, and a second when the pitch goes up only/ after the stress has 
sufficiently increased. That of the falling pitch is a sixth on an 
average. 

(G) Sudden rise and fall in pitch characterizes the Malay lan- 
guage; but when a word the second syllable of which begins with a 
voiceless consonant is pronounced in the same manner as in (F), the 
result will be as that illustrated in Fig. 14. Malay ikan, makan. 
esok, etc. show examples of this type. The pitch-curve for the Japan- 
ese long consonant (sokuon) (e.g. kekko ) is similar to that of 

this class. 
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In the case of the former, however, 
the time-interval between the two 
syllables ranges from 7/ 50 to 10/50 
sec. — greater than that for the (G) 
type which varies between 2/60 and 
3/50 sec. The Japanese long con- 
sonant in ikkan, in quicker pronun- 
ciation, becomes akin to Malay than. 


(H) The two syllables are almost level in stress, and the pitch- 
curve forms a mountain shape in the first syllable, while the second 
shows a falling tendency. See Fig. 15. 

This may be sub-divided into: 



(i) Both initial and 
final syllables are equal 
in pitch at the height. 
See Fig. 15. A. 

(ii) The) initial syllable 
is higher (Fig. 16. B.). 

(iii) The final syllable 
is higher (Fig. 15. C.). 
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These are all Korean, of which t/- a [hag, g]o], ^ Bll[[o‘i‘^] ^ [thal 

tjvim] belong to (i ),'!!;»[ [an ’4ga] to (ii), and lastly, ^1- S Hgau ’gun] to 
(iii); [hag, gjo] and [o’ba], however, seem to resemble the Malay words of 
the (G) type. But in the Korean words, it takes much longer to shift 
from one syllable to another, for they are long consonants. Korean 
words that are so built that the final vowel or consonant of the first 
syllable which is short is liable to link with the initial consonant of the' 
second, are very like Malay words. As compared with the latter, how- 
ever, they are apt to be pronounced in a more protracted manner so 
that the rise and fall in pitch is less rapid. 

(I) The initial syllable is of rising and the second of falling tone. 
The pitch-curve is similar to that for the (F) type, but the fall is 
much more gradual. When a word that ends in and begins with a 
voiced consonant in the first and second syllables respectively is pro- 
duced in the same manner as in the (H) type, its accent becomes of this 
type (Fig. 16.) ; e.g. Korean Ith [am dja], ^ y [goo njil] ^ H) Ptallij ^ 
l| Bj [nil ni Ii], etc. 

Successions of syllables such as ^ ^ .a [phurrm phurrm] that are 

formed by repeating one and the same short dissyllable, are similar in 
pitch to the dissyllable standing by itself. 



Fig. 16. ^0 njil Fig. 17. phurrm phurrm 


Japanese ao-ao is no mere repetition of fio but [od do] ; similarly in 
Korean too, there seems to be the transfer of accent in phrases. 

(J) The Hindustani word, when pronounced by itself, varies but little 
in pitch and stress. A gradual rise in pitch is, however, its peculiarity. 
The stress is greater sometimes in the initial and sometimes in the final 
syllables, there being no regularity. 
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(i) In dissyllables, as is shown in Fig. 18. [khainai] , the pitch 
continues to rise until the stress reaches its climax in the final syllable, 
and then begins to fall as the stress decreases. Besides [khaina:j,[gaja:], 
[d3a:ie:].rth3:ra:],[za:limj,tzala],etc. are good examples. The symbol ['] 
is attached to a comparatively stronger syllable, the absence of which 


indicates that the two are all levd it 




itress (similarly stressed). 

Because of a voiceless 
consonant with which the 
last syllable begins, the 
pitch-curve may some- 
times be broken. By link- 
ing it up, however, we 
have such a continuous 
curve as is shown in Fig. 
18. 

Examples: fletkinj, [mas- 
lam], [.hitam]. 

(ii) The final syllable 
rises in gradation. Of the 
Hindustani words used as 
material for this experi- 
ment, [pandit] ( Fig. 19 ). 
and [suddt], each of which 
begins with d in the last 
syllable, belong to this 
class. 



(iii) Dissyllables the 
second syllable of which 
is longer than the first 
or trisyllables when 
they are divided into 
two breath-groups of 
which the second ele- 
ment is longer, are 
something llke[dDq)aher] 
in Fig. 20-1. 
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It may be noted that 
with this type the 
pitch tends to rise gra- 
dually. Beside [doipa- 
h^r],rtakli:fj,[mari:z],[za- 
ba:n], [pazmurdah], etc. 
all furnish good ex- 
amples. In the case 
of [madarsei] (Fig. 20- 
2), however, the ini- 
tial breath-group is 
longer. 

II. 2. Variations in Pitch and Stress within a Single Syllable 
and Sound Psychology. 

Elxperimental studies of speech-sound that have hitherto been made 
have a weakness in that stress, one of the most important sound ele- 
ments, has been neglected in the discussion of accent and intonation. 
The measurement of stress being a matter of difficulty, the pitch- 
curve alone has been resorted to in such cases. 

It is necessary, however, in the study of speech-sound to know not only 
the pitch but also the change m stress. The absence of knowledge in one 
or the other is indicative of an incomplete investigation of sound. Even 
when the pitch-curve is similar, the acoustic impressions produced differ 
much with the variant degrees of stress. In the following will be studied, 
through actual examples, varieties of combinations of the varying pitch 
and stress and acoustic impressions they give. 

(i) The pitch falls gently simultaneously with the decrease in stress. 

German loschen, uber, 
fallen, asyl, English ba- 
ker, hashy, laboratonol, 
etc. are of this type. 

In Japanese sushi [suji] 
sake [sake] mori [mori] 
niisatt [niisan] hato [ha- 
toj obasaw [obaisan], etc.. 
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however, the fall in pitch and 
the decrease in stress are 
noticed a little more promi- 
nently. The sound with this 
type of pitch and stress gives 
psychologically a feeling of 
inertia. 


This is also the case with the final syllable in Korean and Malay. 
In these languages, however, the falling interval is greater and rapid 
so that the sensation produced is rather stronger. 


(ii) The pitch falls slowly, while 



the stress increases by degrees. 
This type is very common 
among stressed syllables in 
English words of two syllables 
such as foolish, sorrow, ftoMses, 
toilet, clearly, fairly, tourist, 
before, etc. A sense of insta- 
bility is often felt when one 
hears a sound of this type. 


In another type, the stress decreases at the same time that the pitch 
ascends. Such is rare in a word pronounced by itself, but is rather 
common in connected speech both in English and French. English 
birthdo]/ uttered by itself provides an example. In this case, though 
the stress diminishes, the pitch, on the contrary, rises with the result 
that the acoustic effect produced is of greater sonority. The end of an 
interrogative sentence mostly takes this form. 


(iii) The pitch rises suddenly after the stress has considerably in- 
creased: e.g. Japanese iro [iic!],tne [inej.ai [ii], uni [uni], suzu [suzii] 
ehizu [tjizuj, smji [siidji], English cap, German ob, etc. A sound of this 
type gives an impression of “stop” — a feature in the “accented syl- 
lable” of the majority of dissyllables. 
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Fig. 23. 


A similar feeling is given when 
the pitch rises from the begin- 
ning at the same time as the in- 
crease in stress : e.g. English 
head, book, hook, box, etc. The 
initial syllable in Spanish, Malay, 
etc. is analogous to this, only 
with a greater musical interval 
peculiar to them. 


(iv) The pitch rises or falls according as the stress increases or de- 
creases; and the curves are of mountain shape: e.g. English knowledge. 


watch, gold, j/acht, German Erde, Bodea, 
impression of “force”, and is one of the 
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Fig. 24. 


foschen,etc. This type gives an 
formative elements of the (Ai) 
type accent mentioned in 
the previous section. In 
this type, however, not- 
withstanding that the 
range of the rising pitch 
amounts almost to a fifth 
and that of the falling 
pitch to a sixth, the pitch is 
not so prominent. Thus, 
~ the type is in striking con- 
trast to the type (ii) shown 
m Fig. 22. 


(v) The pitch-curves for long vowels and some diphthongs are ap- 
parently similar in shape to those for short vowels, but taking longer 
time to pronounce than the latter, the former produce diiferent auditive 
impressions. To sum up; 

(a) English day, moor, chair, sorrow, star, chalk, etc. when pro- 
nounced by themselves, exhibit curves of such form as illustrated in 
Pig. 25. A (upper) given on the next page. 

(b) English moon, noon, etc. belong to B (middle). 

(c) As is shown in C (below), the pitch-curve rises gradually, while 
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the stress either remains unchanged or slightly varies. Examples: 
English free, moon, eye, bag; German Voter, etc. 


Special attention is called to the fact that in the case of long vowels, a 
personal equation is apt to come in to a greater or lesser degree so that 
one and the same word may sometimes be of B and sometimes of C type. 

Reference will not be made to the question of sound psychology in poly- 
syllables. It suffices to say that the same applies here as in the case of 
monosyllables. 

It will, however, be understood in outline when one compares the 
pitch and stress curves for various sounds mentioned in the next 
chapter with the ways of pronouncing those sounds. But all this will 
be explained later. 

Lastly, a few words will be added as to how the variant degrees of pitch 
in speech-sound are represented. Generally the pitch is represented by 
means of musical notation on the stave. In this book, however, it is some- 
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times represented by means of intervals. And it should be noted that this 
is merely on the ground of convenience and differs a little from what is 
usually done in music. 

Speaking generally, the pitch goes on by steps in music but continu- 
ously in speech-sound. While in the former the stress of a note either is 
immutable or varies only slowly, it changes quite abruptly in the latter. 
Even when the objective inten’als of the pitch (between the rise and 
fall of the pitch) are the same, the subjective ones vary greatly ac- 
cording as the pitch is “gradational” or “continuous”, or with the dif- 
ferent degrees of rapidity in the change of pitch ; or according as the 
pitch varies with the stress proportionately or inversely. For example, 
with German Erde and Malay helot the pitch is not so salient, notwith- 
standing the fact that the interval ranges from a fifth to a seventh. 
Where the stress and pitch run inversely to each other as in Japanese 
66, even the rise of nearly an interval of a second or so in pitch is still 
distinctly audible. There is a considerable difference m the acoustic 
effect between a word in which the pitch stays constant while the stress 
varies along an undulating line and another in which the stress is con- 
stant while the pitch changes forming a curve of mountain shape (pro- 
vided that in this case, the shift from rise to fall is made sufficiently 
quickly). It is with the latter, however, that the stress element is 
more dominant. 


II. 3. Accent of Monosyllables. 

In languages of the so-called “stress accent” type, we cannot distinguish 
forms of accent in monosyllables. For accent can be recognized only by 
comparing one syllable with another concerning their relative strength: 
e.g. we have to consider whether one is stronger or weaker than the other 
or whether the two are equal in strength. In case a word is pro- 
nounced by itself, it is difficult to tell whether the stress is in 
diminuendo or crescendo without any reference to pitch. With 
languages of pitch accent, on the other hand, whether the pitch is 
of rising, level or falling tone is easily distinguished even when 
a word is pronounced by itself not only in the case of the compari- 
son of one syllable with another regarding pitch. This accounts 
for the possibility of accent on monosyllables of the “pitch accent” Ian- 
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guages. What Dr. Sakuma calls “quasi-dissyllable”* in Japanese has 
undoubtedly this kind of accent. This quasi-dissyllable is often pro- 
nounced dissyllabically. In Japanese /lei(soldier), lio (cannon), ^oi (emo- 
tion), kyu (nine), lo (cell), etc., there is a gradual fall in pitch, while in 
hei( enclosure), ho (law), jo(lock), kyu (urgent) , 65(stick), etc., the pitch 
is either level or rises slightly. Though of course the difference between 
the two groups can readily be perceived, they differ but little in the ris- 
ing and falling intervals and their degrees of inclination. In Chinese 
where the principle of "a syllable a word" is at work, recourse is had 
to the four types of accent i.e.“s Jaq”, which are different in pitch, in 
order that homophones (which are homonyms at the same time) may 
be distinguished. Any word of the language has invariably any of the 
four types of accent. They are Jar) phq, ^ia p'l®, Joq Jap, and t'fy Jap. 
To denote each of these four types, a symbol [°] is added to one of the 
four comers of the written character for the word : e.g. Jap p'ift (lower 
left-hand ; 5 ia p'lp, (upper left-hand) , Jap Jap (upper right-hand) 

Jap, (lower right-hand). The figures 1, 2, 3, 4 can be substituted for 
[■’] . All this is exemplified : M [fa'], [fa®], [fa'], [fa']. 

The pitch and stress of the four sounds are represented in curves in Pig. 
26. given on the next page : 

Jap p'ip is of quite level tone, the pitch and stress of which stay immut- 
able. gia p'ip is a short and precipitous sound like that uttered in ques- 
tions, its peculiarity being a sudden rise in pitch. With Jap Jap, the pitch 
ascends gently at first and rises sharply finally, while the stress is slightly 
on the increase both initially and finally. On the other hand, t'cy Jap 
falls quite abruptly in pitch and ends almost inaudibly, taking a shorter 
time in pronunciation. Besides the foregoing, words used in connec- 
tion with the study of "s Jap” are : JtPci'] iS:[t 9 i*J i^[t 9 i‘] Be[t?i']. But 
the pitch-curve is almost identical with that for [fa], and is very regular. 
A little difference is noted, however, in the stress-curve. 

Chinese is essentially a language with the principle of “a syllable a 
word as already described, but sometimes dissyllables or tri- or 
quadri-syllables are formed from monosyllables. In such a case, the 
accent of each element remains the same as before. Accordingly, if a 

* See Dr K Sakuma' “Pronunciation of Japanese," pp 389-395 
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monosyllable has four different types of accent, sixteen different com- 
binations are possible with a dissyllable consisting of two such mono- 
syllables. 

Examples : 

?64[hua’ Jog’] ^®[hua‘ mei’] ^^-Lhua' t'sau’] :?Eil[hua’ mq*] 

^ S5[nwu’ uo’] ^^[mau’ t'cien’] ^:i[mau’ ny^ pu‘] 

^KWao’parf] ftHiftJao’ SUI’J ^iRWoo’ ff] ^:^[tJoo’ ta‘] 
fajo'] ISMti' luoT pan’j iUsSSLti' tuu*] 

Forms of accent can also be distinguished when a Japanese monosyllable 
consisting of a short vowel is followed by /osM(auxiliary particle), though 
not if pronounced by itself. For instance, e, if pronounced sepaxately, 
may stand either for e (picture), or e (handle), but So stands no doubt for 
the former not the latter, e (handle), hi (mind), ha (leaf), no (name) 
are lower in pitch than the subsequent particle, while e (picture) , hi (tree) , 
ha (toott), MO (vegetable) are higher. 








29 


CHAPTER HI 

TONAL PECULIARITIES OF VARIOUS LANGUAGES 

in. 1. Kinds of Accent and Languages. 

With regard to the languages used as data in the preceding chapter, the 
types of accent or pitch and stress may be divided into the four groups 
given below : 

(i) The so-called “stress acccent” that denotes the constancy of the 
relative stress within a word — e g. English, German ; 

(ii) The so-called “pitch accent” that denotes the immutability of 
the relative pitch in a word — e.g. Japanese, Chinese, Korean, Mon- 
golian ; 

(iii) All the words of a given language belonging to this group have 
a pitch-curve of the same type — e.g. Malay, Hindustani; 

(iv) All the words of a given language, in which not only the pitch 
but also the stress are more or less of the same type— e.g. Spanish. 

Of these, (lii) and (iv) have resulted from the classification on the 
basis of registered sound waves. It is, therefore, not evident whether 
the words have invariably the same type of accent whatever their posi- 
tions may be in a sentence. This is true at least from the data collected 
so far. Since, however, these words exhibit, without a single excep- 
tion, a striking feature that distinguishes them from those of other 
languages, it follows that the accent of their languages differs much in 
nature from that of other languages. 

Accordingly, we learn there is yet another type of accent somewhat 
different from the so-called pitch or stress accent. It may also be noted 
that there are varieties even in the pitch or sti'ess accent itself. 

When the first syllable has a rising tone whereas the second has a 
falling pitch, as in Malay and Korean, the two will be felt equally 
strong provided that the pitch intervals (rising & falling intervals) and 
the tempo are great and equal. In pronouncing these words, often we 
cannot decide readily which of the two syllables is really stressed 
(stronger). That rise and fall in pitch effect psychological impres- 
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sions which, though equal in quantity, are yet different in quality is too 
self-evident to be repeated. 

III. 2. Elements in the Formation of the Word. 

Prior to the discussion on the substance of the subject, it is necessary 
to consider how the speech material, as one of the elements in the for- 
mation of speech, and the rest of the elements vary with different lan- 
guages. 

In the preceding chapter, it has been explained that there are many 
types of accent in words as speech material. The question will be treated 
in this chapter as to what tonal arrangements we shall have when a con- 
nected speech is formed by words. 

The following five can be named as elements in the formation of con- 
nected speech : 

(i) Words as speech material; 

(ii) Usage prescribing the relative importance of words (i.e. stress) , 
syntactical pattern, rhythm, etc. best suited to the circumstances ; 

(iii) Contents to be expressed; 

(iv) Style; 

(v) Tone (speech tone). 

These are components of any connected speech. 

From the nature of -this experiment, a somewhat minute study will be 
made on (i) later. The second element is, like the first, yet another 
element of speech material. By “contents” in the third item is meant 
the thought or emotion that the speaker wishes to convey or express. 
The fourth element “style” means a mode of expression ; it may be de- 
scriptive, declamatory, theatrical, recitative or conversational. The 
difference in styles can be said to depend very largely on the degrees of 
exaggeration and intonation. By means of the fifth element, speech 
is given various phases of mood, very delicately formed : e.g. a heavy 
tone, a frivolous tone, a cheery tone, a gloomy tone, a calm tone, a vio- 
lent tone, etc. Though it seem^ that the time element plays an im- 
portant part in the change of tone, the study of this subject will be 
reserved for the future. 

Now I shall give in tabular form those phases of speech material which 
are important in some respects or other in determining the tonal pecu- 
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liarity of a language. 

(i) Kinds of speech-sound: 

(a) vowels and consonants; 

(b) languages of long vowels; those of short vowels; those in 
which both kinds of vowels are of equal importance; 

(c) consonants which are pronounced by themselves; consonants 
which are invariably followed by a vowel. 

(ii) Construction of Syllables : 

(a) languages whose words are all monosyllables; those whose 
words are largely dissyllables; those in which both monosyllables and 
polysyllables are possible; 

(b) a polysyllablic language which has the centre of stress on a 
certain syllable; or one in which each syllable is of equal value. 

(c) each syllable is pronounced separately or successively. 

(iii) Kinds of Accent: 

(a) fixed accent, fiexible accent; 

(b) pitch accent, stress accent; 

.(c) in the case of the stress or pitch accent, the initial syllable is 
stressed or high-pitched ; the final syllable is stronger in stress or higher in 
pitch. 

With a language that has a stress accent, the distribution of stress 
over a word once determined, the pitch may freely vary. On the other 
hand, with one that has a pitch accent it is the stress that may freely change 
while the pitch remains constant. But in such cases, how does the differ- 
ence in accent appear in the resulting speech-sound? Or how does it come 
to be when the accent interferes with the tone (for instance, when at the 
close of an interrogative sentence a word comes that ends in a syllable 
with a low accent) ? The answers to these questions will gradually 
come to be understood in the course of the following explanation. 

As has already been explained, speech is made up of the five elements, 
i.e. speech material, usage, contents, style and tone. A study will be made 
hereunder of the tonal peculiarities of nine languages, viz. English, Ger- 
man, French, Japanese, Chinese, Korean, Hindustani, Russian and Mon- 
golian, chiefly concerning the first two of these five elements. 

III. 3. English. 

The three English descriptive sentences used as data are given below. 
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The speaker of the first is Sir Francis Lindley, former British Am- 
bassador to Japan, and of the second and third, Mr. W. E. Harris, 
sometime Lecturer in English in the Tokyo School of Foreign Lan- 
guages. 

1. It has been often said that books do for us to-day what universities did 
in earlier ages The knowledge that could, five centuries, ago, have been ob- 
tained only from the lips of a teacher can now be gathered from, the printed page. 

2. I wiU give you a red flower. 

3. I will give you a flower which 1 plucked on the hill at the back. 

Fig. 27. on the next page has been obtained, after a strict measure- 
ment, by the representation in curves of the pitch and stress as registered 
on an oscillogram. 

(a) Sentence 1. will first be treated. 

As has already been explained, in a dissyllable uttered by itself, the 
pitch interval of the twb syllables is a fifth on an average with an average 
stress ratio of 2 6 ; the figures vai-ying but little with different words. The 
ratio of stress, however, undergoes modification in connected speech by the 
intonation. Speaking generally, the order of stress is not normally sub- 
ject to change, even if the ratio of stress varies among syllables of a word. 
The change in the order of stress takes place now and then in a poly- 
syllable with almost level accent. (A wonderful example is everybody.) 
The ratio of stress between the strongest and the next strongest syl- 
lables in the words of two and more than two syllables of sentence 1 
are given in parentheses as follows : 

often (3.6), to-day (5), universities (2), earlier (1), ages (12), 

knowledge (2.B), centuries (6), ago (5), obtained (4), only (3), 

teacher (1), gathered (2), printed (1), page (-). 

Probably, it is due not so much to the intonation as to the intrinsic 
quality of the word itself that the stress ratio varies so much with dif- 
ferent words. The stress of flower in sentences 2 and 8, however, 
varies considerably according to position. 

(b) The words it, that, the, a, from, etc. are weaker than others. 

(c) As is evident from the diagram, by piecing together the fragment- 
ary small curves, one continuous and moderately undulating wave-line 
will appear. The resulting pitch-curve goes on quite smoothly, almost 
independently of the stress. This constitutes one of the characteristic 
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Fig. 27—1. 
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Fig. 27—4. 

features of the English language. In this sentence, books and could 
alone are not on the curve. Every bit of the wave-line is complete in 
meaning. At the points of special importance in the sentence, the 
curve makes greater undulations and toward the end it either becomes 
of level tone {did in earlier ages) or shows a succession of short falling 
tones {teacher, printed, page). 

(d) By dividing the wave-line shown above into several breath- 
groups, a study will be made of the pitch and stress in each group. 

First of all, in (1) It has been often said, the pitch is comparatively 
high from the beginning and reaches its highest point with often, and 
after that begins to descend ending in said. The last named is pro- 
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nounced with an ordinary pitch. The stress too is, on the whole, strong. 
In (2) that books do for ns to-day, both pitch and stress decrease. But 
after that comes hooks (strong and high) which is followed in turn by 
do for us to-day (less high-pitched) after an appreciable lapse of time. 
Little variation is observed in pitch in (3) what universities. The pitch 
is lower still in (4) did in earlier ages. The change in pitch may be 
illustrated briefly as follows : 



4 


Fig. 28. 




The pitch is higher in (1) and books of (2). The close of the sen- 
tence is indicated by the fact that (4) is lower in pitch than (3). The 
stress remains fairly strong throughout (1); in (2), books is promi- 
nently stressed, and (3) and (4) are both weaker. In (5) The know- 
ledge that could five centuries ago, knowledge is stressed and high-pitched 
being at a peak of the pitch. CouU and five centuries ago have peaks of 
their own, that of the latter being situated between five and centuries. 

As a whole, (6) is high. The change in pitch is scarcely perceptible in 
(6) have been obtained only from lips of a teacher. In (7) can now be 
gathered from the printed page, can now be makes a' rise with now at the 
height. The pitch is ordinary in gathered from the printed page, so al- 
so is the stress. (5), (6) and (7) may be summed up in the following 
diagram drawn to scale. 

|» e 7 



Fig. 29. 


In sentences 2 and 3 the same relations exist between pitch and stress 
as in 1. But in 3, the wave is of the shorter cycle than in 1. 


III. 4. German. 

(Materials for the experiment are from the nine sentences given 
on page 42, the speaker being Mr. M. Netke, Lecturer in the Tokyo 
School of Foreign Languages.) 
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Fig. 80—1. 
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Fig. SO — 4. 
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Fig. 30 — 6. 

1. Abends ivaren (die matt. 

2. Jeder esse Erdbeeren. 

3. Ihn liebte jie mnig. 

4. Otto holte Obst vom Onkel. 

5. Der Vhu ruft unheimlich vm Dunkeln. 

6. Die aTmlichen Mahren sahen szch lacherltch ahnlich. 

7. Ich horte ihn norgelnd die bfen oftnen. 

8. Vber den Hugel tanzten die Mucken 

9. Et brachte die lynschen Hymnen nach Typen ein System. 

The pitch and stress as recorded on the oscillogram are represented by 
curves m Fig. 30 (1 — 5) on the preceding pages. (The figures in paren- 
theses given below stand for the numbers of the above sentences) . 

From this, it may be noted that most dissyllables (numbering 22) in 
these short sentences are stressed on the first syllable. In pitch, they 
are either higher initially or level. In both mnig (3) and holte (4), 
the two sjdlables have equal stress, and the pitch is higher on the initial 
syllable. On the other hand, in leder (2), liebte (3), iiber (8), Hugel 
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(8), the initial syllable is weaker, higher and longer: viz. the so-called 
“betonte Silbe” with a close vowel is weakened. It seems that with 
these words, the initial syllable can be pronounced more strongly but 
lower, and the last more weakly yet higher without the proper accent 
of the word being much impaired. But it is not permissible to utter the 
word in such a way as to make the final syllable both higher and 
stronger. Accordingly, in this case the pitch can be or has been sub- 
stituted for the stress. 

The foregoing German sentences have been chosen with a view to the 
analytic measurement of the German vowels. Correspondingly in the 
following will be treated the pitch and stress as registered on the osillo- 
gram of the materials gathered this time for the purpose of studying 
intonation (the speaker being the same as in the former). 

1’. Horten Sie gestem abend den Vhu ra/en’ 

2' Hein, leh habe nichts gehort 

S’. Ja, waren Sie denn gestem nicht hier') 

4’ Ja, ich bm nicht fort gewesen 

5’. Nein^ Und Sie horten nicht den dunkeln RuP 

6’. Ja, denn habe tch gehort 

7’, Ja, das war der Uhu 

In the above sentences, each of the words horten, gestem, hade, gehSrt, 
Uhu occurs twice. (The pitch and stress curves of the sentences have 
been omitted.) Except in the case of Uhu, they are very much like one 
another in their stress-curves; the pitch-curve is level with habe, and 
with gehort, it falls in ge- and has a rise-fall in -hdrt. In each case, 
the peculiarity is comparatively well retained. With gestem, however, 
it is a little different. The word is high pitched on the first syllable in 
the sentence 1’ and is low pitched on the last in 5'. The curve, in 
each case, has a rise in the forepart and a fall in the final portion. 
Consequently, the accent of German may be described as “stress 
accent”, and is more immobile as compared with that of the English 
language. 

The ratio of stress in the dissyllables and trisyllables contained in the 
foregoing German sentences will be given below in order to show the im- 
mutability of stress accent (the ratio for the trisyllables being that be- 
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tween the strongest syllable and next strongest). 

1. abends (2 7), waren (2 3), alle (13). 

2. jeder ( - ) , esse (1.3) , Erdbeeren (1 3) 

3. hebte ( - ) , mnig (1.2) . 

4. Otto (21), holte (11), Onkel (43). 

5. Vhu (10), unheimitch (12), Dunkeln (2 3). » 

6. armliohen (IS), Mahren (19), sahen (2 0), ahnlich (2.0), lacherlich 

(13 15 6). 

7. Norgclnd (2 6), Oten (3 0), ognen (3.0) 

8. iiber ( - ) , Hugel ( - ) , tansten (3 2) , Miicken (2.9) . 

9. Tjrachte (3 7), lyrischen (14.9:6), (1.3), Typen (-), System (1.4). 

1’. horten (5 0), gestern (16), abend (16), Uhu (1.5), nifen (2.0). 

2’. Ao6e (2 0), sreftorf (1.4). 

3’. waren (1 9) , gestern (1 8) . 

4' gewesen (2 0) . 

5’ horten (4 6) , dunkeln (1 1) . 

6’. habe (1 7) , gehort (1 4) . 

7’. Vhu (2 6). 

Of the ratios given above, the greatest is 5.0 and the smallest 1.0, the 
average being 2.1. The last figure is larger than the average of 1.4 
for the words said in isolation. The difEerence is due not so much to the 
variations in intonation as to the diifering qualities of the words. A 
glance at the pitch-curve of German tells us that a sudden fall in pitch is 
immediately followed by a sudden rise. When repeated in a sentence, this 
gives rise to a kind of rhythm. Sentence 4 furnishes a typical example. 
This peculiarity perhaps results from the fact that the German words are 
so pronounced as to bear stress on the first syllable. That such a char- 
acteristic feature is in accordance with the German usage is clear from 
the existence of a similar phenomenon in the monosyllables. Examples: 
ikn liebte (3), hebte sie (3), Obst vom (4), ihn nS-'gelnd (7), Hymnen 
7iac?i(9), Typen ein (9). 

In English, the pitch-wave is either of long or short cycle, and is com- 
plete in itself or becomes a small centre in a sentence. The case is 
slightly different with German rhythm. Both in English and German, 
the pitch does not rise toward the end of a sentence (when the sentence 
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is descriptive), but is in general level or falling. Again, in English, the 
stress is diminished rather quickly in final positions of words, while in 
German both the pitch and stress decrease more gradually. 

III. 5. French. 

The following five sentences have been chosen as material for the de- 
scriptive style of French speech. The speakers are Messrs M. F. Nouet, 
M. M. Chabas, and M. Mott for 1, 2; 3, 4; and 5 respectively. 

1. Papa va a Paris. 

2. he fer a ete ehauffe dans le layer. 

3. On a hattu un maim pour avoir eouru dans les fleurs des deux corbeilles. 

4. La France fut un royaume dd mcnarchie absolue jusgu'd la rivolutUm de 
1789 (.dixsept cent guatre-vingt-neuf) pour etre aujourd'hui une Rd- 
publique 

5. La misere d’un pauvre enfant Cest dans cette dtrotte et sombre rue de 
Pirigueux, ou nous demeurions alors, gue je devais eprouver Virreparable 
malheur de ma vie. 

The pitch and stress of these sentences as recorded on the oscillogram 
are represented by curves on the following pages: 

a) First of all, the stress-curve will be studied. In a group (consist- 
ing) of several syllables, the difference in stress is only small between 
neighbouring syllables. When it is great, that denotes a considerable 
space of time intervening. By linking together the peaks of the stress- 
curves of the syllables, a smooth curve may be obtained. A part of the 
stress-curve of fusqu’a la t evolution is illustrated in Fig. 32. 

The curve takes various kinds of forms besides those of mountain shape 
given in the diagram. These are either level or have a gradual rise or 
fall. In Fig. 33 are illustrated the stress-curves of sentences 3, 4, 6 
and their curves obtained by piecing together and getting rid of ir- 
regular indentations of the smaller fragmentary curves. 

b) French is said to be a language with “stress accent” where the final 
syllable is stressed more than any others. In connection with this, inter- 
esting phenomena have been noted. French has a peculiar system of tone : 
the pitch element of words forming a sentence is determined according to 
their position in the sentence (by this is not meant, however, that the 
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Fig. SI— 3. 
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Fig. 31 — 4. 






Fig. 32. 





Figr. 33. 

stress is free from such restrictions.) For instance, when the subject 
beginning a sentence is either a word of more than two syllables or pre- 
ceded by an article, it (e.g. /er, Fmnce), or its final syllable (e.g. -pa, 
-8^re) becomes higher as in papa (1), le fer (2), la France (4), la mis'ere 
(6), etc. Again, when a continuation is implied as in on a battu (3), un 
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mcMn (3), the pitch-curve has a terminal rise — a feature also charac- 
teristic of the tone system of the language. 

As has been previously explained in the section on sound psychology, 
the rise in pitch is heard more prominently when the pitch and stress 
run inversely. It is perhaps for this reason that the last syllable of 
words in a sentence with rising tone becomes higher, weaker a*nd 
longer. To draw special attention to this fact the words are under- 
scored in Fig. 33. (&attu(3), matin (3), avoir (3) absolue (4) aujourd’- 
hui (4)). 

Even the last syllable of words with rising tone may sometimes be 
stressed more than a preceding syllable, provided that it ends in a long 
vowel. For example, in Pingueiix, -gueux is stronger than Peri-. But 
as far as -gueux is concerned, it becomes not only higher but weaker. 
When a sentence is approximately level in tone and the last syllable 
of the words is not very high-pitched, the two syllables are almost equal 
in stress. The last syllable becomes stronger as in couru (3) , and ktroite 

(5) , when the sentence is of level or falling tone. 

Cases in which the pitch varies in inverse proportion to the stress are 
not confined to the above examples, as is instanced in the syllables that 
together make up a word. Examples : cor-, -gueux in corbeilles (3) , Peri- 
gueux (5) ; the trisyllabic livolu- in revolution. 

But at the same time the pitch and stress often vary not inversely. In 
royaume (4), both pitch and stress decrease and similarly monarchie is 
stressed on the first syllable with -chie being slightly higher. In demeur- 
ions and alors in the sentence 5, the stress remains unchanged and the 
pitch is level in the former and rising in the latter. Re-, -tion in revolu- 
tion (4) are both stressed and the word as a whole has a rising tone, while 
in jusqii'h, the final syllable is stronger and higher. In misire, eprouver 

(6) , the pitch and stress increase in the final syllable; and in malheur (6) 
the corresponding syllable is higher though not strong. The pitch and 
stress are in proportion in these three words, for they are located in a sen- 
tence in those positions in which the rise does not take place. 

c) In English and German, languages of the stress accent type, the 
pitch and stress decrease only by degrees when the sentence ends in a 
word with weak stress on the last syllable. In French, on the contrary. 
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as is shown in Paris {1), foyer (2), corbeilles (3), etc., the pitch falls only 
after having once risen. And the stress is weaker on the last syllable 
which is longer instead. 

These observations reveal that French is different in character from 
English and German. Whether French with such peculiarities is a pitch 
accent or stress accent language, or belongs to neither of the two catego- 
ries will be dealt with in the next chapter. 

III. 6. Japanese. 

Of the following sixteen Japanese sentences used as data, the first 
twelve are descriptive. 

1. £ 0 kaku; Enogu o tsiikau 

(to draw a picture; to use colours.) 

2. Shojikl na hito da 

(He is an honest man.) 

3. Eigo mo manaitmasu 

(I learn English as well.) 

4. Kasukani mteru. 

(I can see it vaguely.) 

5. Seikatsu no tamem. 

(for the sake of living.) 

6. Seitoi ga koe no moto desu 

(The vocal cords are the source of the voice.) 

7. Baku wa mareigo no gakusei desu 

(I am a student of Malay.) 

8. Baku wa Kakei-kun no sdbetsukai nt shusseki shimasu. 

(I shall come to a farewell meeting to Mr. Kakei.) 

8. Kind Meijijingii ni sanpai slumashita 

(Yesterday, I went to the Meiji Shrine to pray.) 

10. Ano htto wa nyugakushiken ni shubiyokv} kyudai suru rashti. 

(He is very likely to pass in the entrance examination with great 
success.) 

11. Anata ni akai hana o agemashd. 

(I shall give you a red flower.) 

12. Anata ni kesa urayama de totta hana o agemmhd. 

(I shall give you a flower which I plucked on the hill behind the 
house.) 
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The following sentences are emphatic with the exception of No. 16, 
which is interrogative. 

13. Hanaga kiretda. 

(The flower is pretty.) 

14. Kiretnanowa hanada. 

(Pretty is a flower.) or (A flower is a pretty thing.) • 

15. Sotsugyoshiki o shikusu. 

(I congratulate you on your graduation ceremony.) 

16. Horn? Aki? 

(Spring ? Autumn ? ) 

The descriptive sentences (1-12) have similar pitch and stress curves. 
The pitch-stress-time curves of Nos. 6, 7, 8, lO and 12 may be given, 
as these sentences have more or less characteristic features and variant 
types of accent. 

(A) The descriptive style of Japanese speech has a very simple form, 
and as is shown m the diagram on pp. 56-57, the stress gradually de- 
creases from the beginning to the end or remains almost the same. 
Throughout the sentence, there may be observed only a slight change 
in its tone (the partial average of pitch). In Fig. 34. are illustrated dis- 
syllables and polysyllables such as koe, boku, hana, Mareigo, gakusei, 
Kakei, asbetsukai, shiisseki, anohito, nyugakushiken, shiibiyoku, kyudai, 
anata, kesa, urayama- etc. Little difference, however, is perceptible in 
pitch variation, as characterised by accent 

(B) A few words have previously been given as examples to show 
characteristic features of the Japanese language. Further investigation 
of other speech material, i.e. of words pronounced separately or 
in connected speech, reveals a peculiarity unknown to any lan- 
guage with stress accent. In order to represent accent in Japa- 
nese*, it has been customary to draw lines (in vertical passages) 
on both sides of or above and below (when written literally) a 
letter. In this way the relative pitch between syllables may be 
known. It should be noted, however, that this is purely on the 
ground of convenience and that we have here only a subjective 

* See Dr K Sakuma “Pronunciation of Japanese,” p. 408; "Pronunciation 
and Accent of Japanese,” p 161. 
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representation of pitch. In fact, this system of representation is not 
so designed as to denote that the pitch varies in gradation. This is 
because in most Japanese words, the pitch shifts from one syllable to 
another, either quite smoothly as in obasan or quite abruptly as in uni. 
In order to make these points clear, several examples will be given 
which have been classified according to the shape of the curve. (The 
figures in parentheses given below stand for the diagram numbers of 
the Japanese sentences.) 

(a) The pitch varies smoothly. 

(1) The pitch remains almost constant. 

seitai (34), sanpai (#p) (34), taiie (B®) 

In seitai and sanpai, the pitch is not uniform throughout the word; only 
the partial average of the pitch of each syllable is equal. It goes without 
saying that some syllables have a slight rise while others have a slight fall. 


(2) The pitch slightly rises. 


Rising Interval 
(Ratio of Pitch) 


tonbo ( tl t^) 1.07 

morau (S A ) 1-06 

kanemoehi (&^) 1.10 


(3) The pitch gently falls. 

Falling Interval 
(Ratio of Pitch) 


bs 1.16 

koe (S) 1.12 

karasu !J§) 1.06 

kirei (SSffi) 1-12 

soramame ( 5 c, i ® ) 1.20 

ksrogi (jg 1.30 

tsgan (<5- W.) 1.45 

eiga (KS) 1.90 


As is shown m Fig. 34, the pitch-curve of koe is level initially and then 
falls in an almost straight line. In soramame and eiga, the pitch falls 
after having risen. The former is said to be of the type “low-high-mid- 
mid”, in which the initial so syllable is higher m pitch than the final me. 
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Fig. 34—1. 
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(4) The pitch rises and falls. 

Rising Interval FaUing Interval 

(Ratio of pitch) (Ratio of pitch) 

b<m (&) 1.06 1.06 

nyiigakushikeni 1.20 1.20 

ohasan (felfebS/i) 1-57 1.83 

niisan 1-B4 2.20 

(b) The pitch rises somewhat abruptly. 

Rising Interval 
(Ratio of pitch) 

eo 1-46 

eigo (3E^) 1.42 

These words, as distinguished from those mentioned below are felt to be 
not so strong considering the greatness of the interval. This is because 
the pitch rises in them in proportion to the stress and not so abruptly, 

(c) The pitch rises suddenly and then slightly falls. 

The rising intervals of the pitch and the interval between two syllables 
at the points of the highest pitch are given as follows : 



Rising Interval Interval between 

(Ratio of pitch) Syllables 

aki (f>0 

1.20 

— 

me (IS) 

■ 1.08 

1.06 

uni (Sfl-) 

1.13 

1.15 

at (^) 

1.10 

1.08 

The pitch 

of two syllables goes on in gradation. 


Interval between Syllables 
(Ratio of pitch) 

moto 

1.07 


hata (.M) 

1.09 


sake (19) 

1.15 



1.18 


aki 

1.18 


hato (Ml) 

1.09 


sake (£) 

1.09 


mochUM) 

— 


aki (IM) 

— 
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In the last three, the pitch rises abruptly on the first syllable as in (c) , 
(e) The pitch falls or is level initially, suddenly rises and then be- 
comes level. (This is principally what Dr. Sakuma calls “undulation 
type”.) 

Rising Interval 
(Ratio of pitch) 


obasan 

1.10 

sugu (to 

1.23 

aozora (WS) 

1.20 

shubiyoku Cg'lii < '> 

1.20 

sakujitsu (BfH . 

1.25 


On the syllable a of aozora and the syllable o in obaaan.ihe pitch falls, 
and at the same time the voice becomes strong ; accordingly the highest 
pitch is not felt so high. On the other hand, the points of highest pitch 
and maximum stress coincide in sait of obasan. The impression of pitch 
produced by the syllable, therefore, corresponds not to that of the 
average pitch but to that at the point of highest pitch (at the same time 
the point of maximum stress). 

(f) fro has a sudden fall in pitch on the syllable ro, the falling interval 
being 1.24. 

The above examples show that the interval ranges chiefly from 1.06 to 
1.25 (i.e. minor second — major third) in words consisting solely of short 
vowels, and that in those formed by long vowels alone it varies mostly 
between 1.3 and 2.0 (i.e. fourth-octave). The interval varies greatly 
from individual to individual and also according to the intrinsic quality of 
the word. In a word with a greater interval, the pitch varies proportion- 
ally with the stress. Consequently we have no such impression that the 
pitch alone is changing widely. 

Besides those above-mentioned, the remainder are mostly combinations 
of the various forms of pitch and stress. For example, the difference 
between Kakei (name of a person) and kakei (household account) in both 
of which high and low pitches and rising and falling tones are combined, 
lies in that the second syllable is either of rising tone or of falling tone. 
The interval between ka and kei at the points of highest pitch is the same. 

(C) The construction of Japanese syllables is entirely different from 
that of English, German and French syllables. In these European Ian- 
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guages, there are long syllables as well as short ones which are uttered 
with varieties in speed and are more or less isolated. Japanese syllables, 
on the contrary, are linked up with one another, each consisting of a short 
vowel preceded by a consonant. The connection is never broken except 
when it is forced to be to facilitate pronunciation. The Japanese sentence 
No. 7 given in Fig. 34. is perhaps a typical example. The curve' in 
Fig. 34. is broken here and there. This is because the vowel becomes 
voiceless as in hi and ku or there intervenes a fricative with a small 
amplitude such as [sj and [J], 

(D) In order to see if the accent of a word undergoes modification 
by emphasis, research has been made into the relations between 
pitch and stress of the sentences Hanaga kireida {ham, is given 
emphasis), and Kireimnowa hamda (Kirei emphasized). As is 
noticed in Fig. 35., the pitch-curve at the junction of the hana 
and ga in No. 13 is identical with that at the junction of the non- 
emphasized hana and da in No. 14. The accent, therefore, undergoes no 
modification by emphasis. In ham of No. 13, the na syllable that is at the 
“peak of accent” is stronger than ha that is not and the subsequent aux- 
iliary particle ga. But in hana of No. 14, na is not much stronger than 
other syllables. The same is the case with kirei. From this we learn 
that emphasis depends upon stress in languages with pitch accent such as 
Japanese. When, however, an emphasized word or phrase stands apart 
from other words or phrases and there exists no accentual relation be- 
tween them, the pitch of the word or phrase can be raised as it is, without 
the shape of the pitch-curve being much altered. The shukusu in sen- 
tence No. 15, Sotsiigyashiki o shukusu, is an example of this. 

In the sentence Hanaga kireida, ham has the highest pitch and the ef- 
fect of emphasis is obtained by producing the syllables stronger; but in the 
case of ktrei the emphatic effect is not made merely by stronger pronuncia- 
tion, for the word is lower in pitch than hana. In such a case a sentence 
is employed with a suitable tone that contains a special word. Examples ; 
Hamga kireidane ; Hamga hijoni kireida. 

(E) When a sentence closes with a word the final syllable of which is 
low-pitched, how does this affect the pitch accent? To solve this question, 
a study has been made on the oscillogram of sentence No. 16 Harul ATal 
As noticed in Fig. 35-2, in Haml the pitch falls somewhere between 
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ha and ru and then rises. In this case, a strong stress falls on the first 
syllable, and the stress decreases toward the end. The ascent of pitch is 
made prominent since the pitch varies in inverse ratio to the stress. Ru 
is much longer than in a descriptive sentence. In normal pronunciation, 
ki in afci usually turns to a voiceless sound, but in a question it retains the 
full value of the voiced sound, A and kt are almost equal in the average 
pitch; in a, however, there is a sudden rise in pitch so that this syllable is 
felt to be stronger than ki as is instanced in itni. Furthermore, ki is 
weaker and a little longer than the preceding a; and a slight rise in pitch 
in the syllable ki shows that the word is used interrogatively. By these 
features the word can be distinguished from aki in Fig. 10. Analogous 
examples are numerous in Japanese, but detailed investigations with re- 
gard to them will be made on some future occasion. 

III. 7. Chinese. 

As material for the study of Chinese, the following three expressions 
and four sentences have been used, the speaker being Mr. Chai Shou 
Chih. 


1. 

M,. 

[ti‘-fau'] 

2. 

M °m 

r t;au’-iuo= ] 

3. 


°B5 [tjuu'-hua'-min'-kuo'] 

4. 

w 

IT .S ^, ? [ nr-jQ 0 '-na’-er-t'?y* ] 

5. 

-b. 

fc [ Jau'-9ye“-?iau*-t‘5y‘ ] 

6. 

11° 

°# M ."I! ? [ ni’-nien‘-j3n'-mo'-Ju'-na‘ ] 

7. 

?lc° 

,1115 t uo’-nien'-kuo^-y -na* ] 
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Fig. 36—2. 


It has already been stated that there are four kinds of accent in Chinese 
and that they are represented either by the symbol [o] in different 
positions or by figures. The sign [— ] stands for emphasis. (The em- 
phatic sign and the symbols of accent have been supplied by the 
speaker himself and a Japanese teacher of Chinese). 

(a) In Japanese polysyllables, the length of the syllable is extremely 
short and the pitch is subject to restriction only at the junction between 
syllables and is otherwise free to rise or fall either at the beginning or 
at the end of a word. Accordingly when at the end of an interrogative 
sentence comes a word that is low-pitched finally, the pitch falls where 
the penultimate is joined on to the last syllable and then rises again to 
show the sentence is interrogative. The pitch at both ends of the word 
is, therefore, free to rise or fall unless the accent undergoes modifica- 
tion. 

In Chinese, on the other hand, each word has a stereotyped rise and 
fall, as it were. When the word becomes one unit in the formation of 
connected speech, how is such an immutability in pitch affected? A 
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few words will be said' on this question. 

When two “Jou Joo” (syllables of level-rising tone) occur consecu- 
tively, the first is altered to “fia p'iu” (abrupt rising tone), according 
to the rules of change in “a Jao”. There seem to be no other rules and 
in the case of a compound consisting of two or more than two words, the 
accent of each element remains the same as when it stands by itself. 

(b) As shown in the diagram, [ti'fao']and [tjou’luo®] furnish exam- 
ples of dissyllables. In the former, the initial syllable has a fall and the 
final is level. This shows that the accent is by no means modified by the 
union of the syllables. On the other hand, in [t/oo-luol.E luo] that is pre- 
ceded by ttjau] is uttered on a falling tone when it should be on a rapid 
rising tone. This is contradictory to what has been said before and may 
be due to the fact that the speaker has employed a dialectal variety of 
speech. In the quadrisyUable [ tJuu'-hua’-min'-kuo“ ], the first syllable is 
uttered on a level pitch and the last three on a rapid rising pitch. [ tjuo 1 
and [kuo] , therefore, as illustrated in the diagram, are not deprived of 
their original features in accent. The pitch rises in [ hua ] and [ min ], 
but their pitch intervals are smaller than the rising interval of about a 
fifth in [t?i’J and [fa“] uttered in isolation to study “s Jau”. 

Except where syllables are so combined as to ensure easy utterance, 
e.g. in such a dissyllable as is uttered on a level pitch initially when there 
IS a rise or fall on the final syllable, or on a rising pitch with a final fall, 
and vice versa, the rising interval is naturally small as when syllables are 
uttered on a rising sequence inasmuch as they are closely connected to 
form a new phrase. 

(c) In sentence No. 4, [ m'-Jau'-na’-er-t'?y ] the auxiliary particle [ -er ] 
is combined with tlie preceding [na’] to create a syllable. As shown in the 
diagram, the rise takes place on [ni ] and [/au-na-er]is uttered on a fall- 
ing sequence. The pitch then ascends toward a higher note and the fall 
takes place in [ t'qy ]. That the curve for [na-er ] is of such a shape as is 
shown in the illustration is due to the particular form of accent of the 
syllable. In sentence No. 4, in consequence, the accent of words retains 
its proper form and all the syllables have the stress of almost the same 
degree. 

In sentence No. 5 [/au‘-?ye*-siau'-t‘?y’],rjao] is uttered with a falling 
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tone, [sye] with a rising pitch, [pou] having a falling tone. Their 
rising and falling intervals are the same as when in isolation. The fall 
should take place in the final [t'gy], but actually the pitch varies only 
a little and the stress is rather weak. 

Sentences No. 4 and No. 6 have been used to see whether there is 
any difference to be noted according as the same [t‘?y] with falling tone 
is at the end of an interrogative or a descriptive sentence. In No. 4, 
the above-mentioned syllable has the stress of nearly the same degree 
as any other and has rather a high pitch which is quite natural. The 
stress IS extremely weak, however, in the same syllable in No. 5, the 
pitch being rather low. It may be observed that the accent undergoes 
alteration in the latter case. Unlike other languages, the difference 
between the Chinese interrogative and descriptive sentences lies in the 
fact that in the former the pitch can be raised en masse while 
maintaining the falling tone and simultaneously the status quo, so to 
speak, of the stress. In the sentences No. 6 [ni'-nien‘-/3n'-mo'-Ju‘-na'] and 
No. 7 [ no’-nien*-kuo’-y*-na’ 1 the [ na' ] is level in accent, but in No. 6 the 
pitch is rather high on the same syllable, being a continuation of the high 
pitch of the preceding [ /u ] ; and the stress is about the same as on other 
syllables. The accent of the syllable is, therefore, quite normal. In No. 
7, [na] is weak and has a lower pitch than the preceding syllable, but 
the pitch rises instead of being level. The syllables [ni], [nien] and 
[Jan] are pronounced respectively with rising, falling and rising tone, 
so that in each case the accent is quite regular. And yet [ni'*] and [Jan’] 
can clearly be distinguished, although they are alike in having a rising 
tone. Again the accent is quite normal in the syllables [ mo], [ Ju ], [na] 
with level tone. [ uo ] , [ nien J and [ kuo ] in No. 7 are also natural ; 
but in [ y’ ] the pitch continues to rise for about 2/50 see. and then falls 
— this appears to be not in accordance with the normal pitch of [ y’ ]. 
Instead, the stress is remarkably on the increase. 

(d) In Nos. 3 and 6, no great divergence in the degree of stress of the 
words is perceptible, but in No. 4, [kuo] is stronger and in No. 6 [^ye- 
5 iau] both stronger and higher. The difference in stress is almost negli- 
gible in No. 6, but in No. 7, [ kuo-y ] is stronger and higher. 

It IS customary in Chinese to pronounce the principal word of a phrase 
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with as exact an accent as possible and to give special force to its tone 
BO that it may stand out from the rest of the phrase. This is called in 
Chinese “tjung nien” or fiau ” of which the explanatory ex- 

amples are given above. 

The stress-curve forms a trapezoid, both ends of which sometimes 
jut out. All this is peculiar to “t/uu nien”. For example, the stress- 
curve of [ ti-fau ] in the sentence No. 1 is characteristic of Chinese and 
unknown to any other language. Besides long vowels, diphthongs, 
triphthongs and nasal vowels are found in Chinese. The reason is this : 
inasmuch as any one single syllable forms a word in Chinese, the syllable 
(naturally) tends to be pronounced longer than otherwise so that there 
may be more varieties of sound and the accent may be more prominent — 
a feature that is unmistakably characteristic of the language. 

III. 8. Korean. 

The following six sentences have been treated as material for the study 
of accent in Korean, the speaker being Mr. Chang In-Sap, Professor in 
Engi College, Seoul. 

1- Wso son gjau sou dsau in sa^,] 

2. ^ n^al bab, man tha] 

(There is plenty of boiled rice.) 

3. a ^ 4 [kh+ n ga^i ?dgidi 

(I put on a big strawhat.) 

4. cf tV Of # C|> [ba da ga ?da ‘■'df di ha da] 

(The sea is warm.) 

6. c) 0] C] 7l- ? [na a di ga] 

(Where are you going?) 

6- 1± ^ 2f- [nan mol la] 

(I don’t know.) 

(The symbols in parentheses are those used for phonetic transcrip- 
tion of Korean speech in Korea). 
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Fig. 37—1. 






TONAlr PECTJUAHITIEa OP VAHIOUS IJINGUAGES 


69 



Fig. 37—2. 

In Korean, as in European languages, innumerable sounds are created 
by the combination of vowels and consonants. For instance, the initial 
letter 2 [dfo] of sentence No.l has been formed by adding the vowel 
j. [o] to the consonant ^ [^g]. In like manner, as ^ stands for [ s ], 
for [a], and ,_for [n], the three make {d [san]. 

(a) Korean consists of the elements native to the language, and various 
idiomatic phrases adopted from Chinese. It bears a close resemblance to 
Japanese in the use of auxiliary particles and in word order. The two 
languages, however, differ greatly in their phonemes (both vowels and 
consonants) . A comparison of two languages which resemble each other 
grammatically, and yet differ phonetically is very interesting in connec- 
tion with the problem of accent. 

(b) The stress of sound will first be dealt with. 
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The stress increases in No. 1 and decreases in No. 4 as it approaches 
the end. In Nos. 2, 3 and 5, the stress remains almost constant through- 
out the sentence, but the final .syllable is especzally stressed in No. 6. 
Of course, with these materials in hand we cannot hasten to lay down 
any definite rule as to stress. 

(c) A study will now be made on pitch. Sentence N6. 1 does not sup- 
ply very significant data. It consists of the dissyllables [d^o son] and 
[gjao sau] and the trisyllable [^335 in sab,] pronounced in succession, 
[san] in the first dissyllable is higher than [^30], and [sag] in the 
second is uttered on a higher pitch than [ gpo ]. Again, [ dsau ],[ in ] and 
[sab,] are similarly pitched, which is indicative of pitch accent. 

In the Korean dissyllabic words illustrated in previous section, the 
first syllable has a rise or rise-fall and the second a falling pitch and there 
is a difference in pitch between the two syllables (Pig. 16). In this con- 
nection, it has been pointed out that Korean is capable of being classed 
under the group with pitch accent. Analogous examples of this type of 
accent may also be seen here. Namely, of [’sal bah,] in No. 2, [kh4n ga^i] 
in No. 3, and [ha da] m No 4, the first is high-pitched on the final syl- 
lable and the last two on the initial. The reason why the second syllable 
of [Isalbalj,] and [khtn gad,], unlike that of the isolated [d^ig ■’gun] and 
[an ?(]3a] as mentioned in chapter II, is not uttered on a normal falling 
tone IS this : the syllable begins with a voiced consonant so that it links in 
pronunciation with the preceding syllable. 

(d) In the descriptive sentences Nos 2, 3 and 4, the pitch uniformly 
falls sharply when the end of the sentence is reached. Although No. 6 is 
interrogative, a remarkable rise takes place finally (to judge from other 
examples, this is presumably due to the manner of pronunciation). Being 
declamatory. No. 6 is especially stressed in the end of the sentence. We 
shall perhaps have a similar phenomenon, if a Japanese sentence with the 
same meaning as this is pronounced. 

III. 9. Hindustani. 

The following thirteen sentences spoken with standard Delhi accent 
have been used as material for Hindustani, the speaker being Mr. 
Barlas. 
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1 . man aid; subah sa:t badse: uthkar madarse: gaja: 

(This morning I got up at 7 o’clock and went to school.) 

2. doipaher ke bad main waihd saihab ke madarse: aiia 

(In the afternoon I went to my father’s school.) 

3. jahain profesaron se bait t/iif Id: 

(There I had a talk with the professors.) 

4. kaim ke hatam home: par ma n ghar dsai-umga: 

(After finishing up the work, I shall go home.) 
aur tjhe badge: khaina: khai-uinga: 

(and at 6 I shall take dinner.) 

5. phir dm ka: sabaq jaid kame: ke baud main sd: dgai-uinga: 

(Then after revising the day’s lessons I shall go to sleep.) 

Conversational style: 

6. kjai aip djaipaini: khaina: khai sakte: ham? 

(Can you eat Japanese meals?) 

7. dgi: hain mam fhaira kha: saktai hum 

(Yes, there are some dishes that I enjoy.) 

8. dgaipami: aibahawai kaisii maaluim haiti: hai ? 

(What do you think of the Japanese climate?) 

9. garmi: zijaidah taklnf nahiin de:^ leikm dga're: meiQ sardi: sah( 
mkliif deitii hai 

(The heat does not trouble me, but I suffer from the cold in winter.) 
Emphatic sentences: 

10. je hii meirii kitaib hai (“je” emphasized) 

{This is my book.) 

11. jih meirii hi: kitaib hai ("meirii” emphasized) 

(This is my book.) 

12. jih meiri: kitaib hai 

(This is my book.) 

13. kis ns us ks mairdulai? 

(Who killed him?) 

Of the above thirteen sentences. Nos. 2, 6, 6, and 7 have been excluded, 
for they bear a close resemblance to Nos. 1, 4, 8 and 9 respectively. The 
diagrams of curves in which are shown the relations between pitch and 
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Fig. 38—2. 
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stress in the remaining sentences (Nos. 1, 3, 4, 8, 9, 10, 11, 12 and 13) 
will be used as data for an examination of the accent of Hindustani. 

(a) Hindustani is closely allied to Japanese and Korean in its word 
order and the use of auxiliary particles. The pitch and stress vary only 
slightly in the language. 

In polysyllabic words uttered in isolation, the pitch continues to rise 
gently up to the point of maximum stress on the final syllable, and then 
begins to fall simultaneously with the decrease in stress. It is the same 
when the word occurs in connected speech except that in this ease the pitch 
remains high, under the influence of the subsequent word, up to the close 
of the final syllable. Examples : [subah], [badge:], [uthkar], [maijarseij, 
[gajai] in No. I ; [doipaher], [takliif], [hatam], [khamai], [thorai], [dgairei], 
[leikin], in the remainder of the sentences. 

(b) Words that end sentences, such as fgajai] ,[a:ja], [kh], [dgai-uinga:], 
[khai-uingai], [hum], [hai] etc., have a terminal fall just as when they are 
uttered as separate words. 

(c) The rising pitch of[dga:pa:ni:], [aibohawai], fkaisk], [maalium], etc. 
that occur in the interrogative sentence No. 8, seems to be a little 
greater than in the descriptive or the narrative sentences, [hain] that 
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comes at the end of the interrogative sentence in No. 6 has a very slight 
rise, whereas in No. 8 the final [hai] shows a falling tone. The rea- 
son is this: No. 6 begins with an interrogative word [kja:] while in 
No. 8 the corresponding word is in the middle of the sentence. As in 
other languages, interrogative sentences without any interrogative 
word have normally a rise when final. 

(d) Although in Hindustani there is a tendency towards a rising into- 
nation, the stress in the language varies almost from word to word. 
Hence the question arises as to whether the language belongs to the 
pitch accent or stress accent type. 

In sentence No. 9, the same word [rjeitii] occurs twice, first at the end 
of the initial clause and then as the last word but one of the final clause. 
In the former case, the final syllable is slightly lower than the preceding 
and has a terminal fall, but in the latter, the final syllable is higher. The 
stress is spread over syllables similarly in both cases, the way of distribut- 
ing stress being more or less fixed in the language. In the emphatic No. 
10 and No. 11, [je] and fme:ri:] are emphasized by attaching [hi] and 
[ hi: ] respectively. These two and the descriptive No. 12 have the same 
word [ kitmb ] in which the pitch varies according to the sentence but the 
stress follows the same fashion of distribution. Especially in No. 11, 
despite the fact that [ hai J is extremely weak, the preceding [-taib ] 
retains some relative stress as against f ki ]. From these observations, 
Hindustani may be said to be a language with stress accent, and a tenden- 
cy towards a rising intonation. 

III. 10. Russian. 

The two phrases and ten sentences given below as pronounced by Mr. 
Wanovsky have been selected as material for Russian. 

1. na prava. (to the right.) 

2. na tjajku tjaju. (to tea) 

3. zvonka i dal'ako razdajotsa tjiiavetjeska gotas fpustat’^. 

(In the hollow a man’s voice resounds loudly and afar.) 

4. moj drat-pr’il’esna mal’tjik. (My brother is a diligait person.) 

5. prava ana xaro/in’kaja. (She is really beautiful.) 

6.ia xarajo alfedai. (I have eaten enough.) 

7. ]a xarajo ab]exal (I had a good excursion.) 
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8.^ xarajo ? (Is it good?) 

da, ^3 xarojo. ( Yes, it is good.) 

10. ivin ivanovitj dotna ? (Is Ivan Ivanovitch at home?) 
da, doma. (Yes, he is.) 
n’et n’idoma. (No, he is not at home.) 

These sentences given above are not written in ordinary Russian ortho- 
graphy, but in phonetic transcription of standard Moscow speech. The 
sign ['] on the upper; right-hand comer of each phonetic symbol and 
over each vowel stands respectively for soft sign and accent mark, 

Of these phrases and sentences, the two phrases Nos. 1 and 2 and the 
sentence No. 6 that is analogous with No. 7 will be omitted. With regard 
to the remaining nine sentences, the relations between pitch and stress 
may be represented in curves as in the diagrams on the following pages. 

(a) In Russian, the pitch and stress vary to a remarkable extent. The 
rise takes place in the first half and the fall in the latter half of a phrase 
or short sentence in description. In the long sentence No. 3, the rise and 
fall in pitch are sharpest initially and become less so toward the end, 
the lowering of tone accompanying this change, 

(b) The “accented vowel” is usually stronger than the “unaccented 
vowel”, but sometimes there is observed but little difference in stress. 
For example, in [ razdajotsa ] the first three syllables are produced with 
much the same degree of stress, but the last syllable alone bears extremely 
weak stress. Again in [t/ilivetjeska] the first three syllables are similar- 
ly stressed, the final f -ka ] is slightly weaker and [ -tjes-] becomes voice- 
less. Speaking generally, the “accented syllable" is longer in duration 
than the “unaccented.” 

(c) The “accented syllable” is not necessarily uttered with greater 
stress than the “unaccented”. But the rise invariably occurs on the “ac- 
cented syllable” of a word within a rising or level sequence in the sen- 
tence. Examples ; [zvdnkaj, [dal'ako], [razdajotsa j, [t/ilavetjeska], f golas) 
in No. 3; f xarajo] (7) ; [ ivan 1 (10), etc. 

There is a drop in pitch, however, on the “accented syllable” of a word 
within a range of falling tone in the sentence. Examples : [ prava ] (1) ; 
[tjaju ] (2) ; [ mal’tjik ] (4) ; [ xarojm'kaja J (5) ; [ ivanovit/ J (10), etc. 
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One of the characteristics of the accented syllable with a level or 
falling sequence m the sentence may be a rapid rise which, followed by 
a level tone, ends with an equally rapid fall and a gradual increase in 
stress. Examples: fpr’il’cgna] (4) ,[3b’ed9iJ (6) ;robjex9i] (7), etc. 

(d) The interrogative is nearly akin to the descriptive sentence in 
this respect. Only on the “accented vowel” of the word with an inter- 
rogatory meaning, the pitch ascends to a remarkably high pitch as in 
[xarajo] (8) .and[d6ma] (10). 

(e) Although there is a rise in pitch on the “accented vowel”, the fix- 
ed element of the word is not the pitch but the stress. In sentences Nos. 
7, 8 and 9, the same word [ xsrajo ] functions differently according to the 
circumstances under which it is uttered. The stress undergoes but little 
alteration, although the pitch takes a special form according to the sen- 
tence. Similarly, [domo] is used in the interrogative No. 10, afiSrmative 
No. 11 and negative No. 12. And like [xarajo] it ivorks differently ac- 
cording to the context. In No. 12, however, it decreases in stress on the 
second syllable so that the negative meaning is made clear. It may be 
known from this that the stress also varies to some extent so long as 
the accent remains unaffected. And tliough the order of stress between 
syllables undergoes no alteration, the ratio of stress varies within a con- 
siderably wide range. 


III. 11. Mongolian. 

The following sentences as pronounced by Mr. Sheh Yun-ch'ing have 
been employed as material for Mongolian. 

1. tere xolda-nai hun maratur mnkuden du ecine. 

(The merchant will go to Mukden to-morrow.) 

2. ene roil bolbalcin tere daba-nai zun-teigus rarci baina. 

(This river starts from the eastward of that mountain.) 

3. ta jiben hile helci uidanu '? 

(Can you speak Japanese?) 

4 ta do: doilxot dortai jui ? 

(Do you like to sing a song?) 

5. tere marraji irenu: ? 

(Will he come to-morrow?) 
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6- tere marraji iren-baxa 
(He will come to-morrow.) 

7. noxai turcizi. 

(A baby dog is born.) 

8. ene noxai bul jehe. 

(This dog is big.) 

9. surhei noxaL 

(A wonderful dog.) 

10. minu dega bolbasu masi ba/ntur ere bolai. 

(My brother is a very brave fellow.) 

These are representations of the Mongolian sounds by means of phone- 
tic symbols. It cannot be assumed, however, that the attempt succeeds 
in representing Mongolian speech with perfect correctness (the fact is 
that Mongolian pronunciation has not yet been thoroughly studied from 
the phonetic point of view and the pronunciation of a large number of 
words varies according as the words are in isolation or in compounds.) 

The pitch and stress curves of No. 10 are given in Fig. 41 and for the 
rest of the sentences they are given in Fig. 40. 

(a) In spite of the close relation between Mongolia and China, these 
being neighbouring countries geographically, their languages differ great- 
ly in phonemes, word formation, phraseology, etc. In phraseology, 
Mongolian resembles Japanese rather than Chinese, in many respects. 
It has two different styles of speech, viz. the formal or literary and the 
colloquial or conversational styles. The pronunciation and accent 
slightly differ in these two cases. The material used in this book, how- 
ever, is from the colloquial style. 

(b) As will be seen from the diagrams mentioned op the following 
pages, the pitch varies very widely and correspondingly to the stress. 
Most words have the “peak of accent” on the final syllable, as is in- 
stanced in [minu], [masi], [bolbasu], [degu], [baratur], [ere], [bolai], 
etc. in sentence No. 10; see Fig. 41-5. In description, however, the end 
of the sentence is slightly high-pitched. 

(c) Mongolian is an agglutinative language, excellent examples being 
[xolda-nai] (1), [daba-nai] (2), [znn-teigus] (2), [^-arui-baina] (2), 
[cida-nu] (3), etc. As in Japanese, the agglutinative form becomes a 
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Fig. 40—1. 
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Fig. 40 — 8. 





Fig. 40 — 4. 

new accentual unit. 

(d) The agglutinative compound is considered either as one unit or 
two units of accent according to the length. When treated as two 
units, the original accent of each component is retained in the compound 
form. But if treated as one unit, the accent of each element undergoes 
modification. The accent of [ire] in [iren-bara] (6) becomes altered when 
[ire] is carried into [irenu] in No. 7. ( [n] that occurs in [iren] in No. 

6 has been attached for the sake of euphony, the root of the word being 
[ire] ) 
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(e) In questions, the interrogative words [nu] (see Nos. 3 and 5) and 
D'u] (see No. 4) are used. As has previously been described, the final 
syllable has a high pitch under the influence of accent in Mongolian with 
the result that in descriptive sentences too, the ending is high-pitched 
and the pitch is of mountain shape if represented graphically, just as in 
the interrogative. Only in the latter case, the rise in pitch is more or 
less'abrupt. 

(f ) It has been pointed out that the final syllable at the “peak of accent” 
is higher in pitch than the preceding syllable. The degree of pitch, how- 
ever, is not definitely fixed but varies according to the circumstances. For 
example, [noxai] is most prominent from the standpoint of accent in No. 
7 and is less so in No. 9. Again in No. 8, the difference in pitch is almost 
negligible except that [no-] is uttered on a falling tone and is shorter 
than [-xai] which has a rising tone. That in each case a similar impres- 
sion is produced, though more or less different in degree, is due to the 
common rise and fall in pitch. It should not be assumed from this, how- 
ever, that the accentual peculianties are lost sight of. In brief, we may 
say that the stress always varies proportionally with the pitch. 

(g) Accent m Mongolian may be classified into several types according 
to the records themselves mentioned in the diagrams. Most of the words 
have the “peak of accent” on the final syllable. Thus in trisyllabic words 
the first two syllables are pronounced alike with level pitch, and the last 
syllable is higher still and its pitch-curve is of mountain shape. Ex- 
amples: [bolbalcm] (2), [ciclanu] (3), [dortai in:] (4), [turiiizi] (7), etc. 
In [maj-atur] (3) and [baj-auir] (10), the second syllable is slightly high- 
er and the corresponding syllable is specially high m [marraji]. The 
pitch remains nearly constant and only shows a slight terminal rise in 
[xolda-nai] (1) and [daba-nai] (2). 

Furthermore, in some dissyllables, the first syllable is on a level or 
falling tone and the second is high in pitch, its pitch-curve taking a moun- 
tain shape. Examples: [mukden] (1), [baxa] (6), [noxai] (7, 8, 9), 
[jehe] (8), [surhei] (9),[degu] (10), [masi] (10), etc. As is instanced 
in [jiben] (3), [minu] (10) and [ere] (10), other dissyllables have a 
rising pitch on the initial syllable. In [tere] in Nos. 1, 5 and 6, and [ene] 
in Nos. 2 and 8, the initial syllable is uttered on a rising tone and the final 
on a falling tone. The initial syllable is stressed m [ene]. 
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(h) From the records given above, Mongolian may be called a language 
with “pitch and stress accent”, since the pitch always varies proportional- 
ly with the stress. A more detailed study, however, shows that the pitch 
rather than the stress is more characteristic of the language. This is why 
I have classified the accent according to pitch. That the stress does not 
necessarily remain constant will be noted when one hears Mongolian 
spoken in a low voice ; viz. whereas the pitch invariably rises in terminal 
positions, the stress falls as often as not on the initial rather than on the 
final syllable. (All Hindustani words have a terminal rise, but there is 
in no way a close connection between the pitch and stress. Immutability 
is observed in the stress which is characteristic of the language.) From 
this, Mongolian can be said to be a language with “pitch accent”. 

Japanese and Mongolian have much in common in phraseology and in 
qualities of accent, but they differ entirely in types of accent. In tonal 
impressions, Mongolian is very much like Chinese, its syllables being 
long and giving an impression of much composure. 
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CHAPTER IV 

THE SCOPE OF ACCENT 
IV. 1. The Origin and Nature of Word Accent. 

"However correctly the sound itself may be produced, the stress and pitch 
elements cannot pass unnoticed. If the same word is uttered with a high 
pitch on thejpitial syllable by some, and on the final syllable by others, 
a great 3^ of inconvenience will be experienced, since the language is 
essentially intended for purposes of communication. Accordingly, within 
one community in which the same language is spoken, some sort of uni- 
formity has been reached with regard to pitch and stress and they have 
gradually been fixed in certain forms. Neither pitch nor stress, how- 
ever, need always be fixed at the same time, for if either is fixed, words 
can be differentiated from one another. Hence the difference between 
pitch accent and stress accent. 

The accent occurs in any language even when there is no difference in 
pitch and stress between syllables and neither rise nor fall takes place (in 
this case the term "level accent” may be used). In consequence, to say 
that the word has no accent means that it is devoid of any peculiarities in 
tone at all, and that as far as pitch and stress are concerned, whatever 
pronunciation is used should be accepted as right and good. It should be 
remembered, however, that some words have no accent at all by nature, 
the Japanese interjection sff being an excellent example. We are quite 
unable to recognize in the word any definite form of accent, since vari- 
ous shades of meaning can be expressed by changing the degree of 
pitch and stress. 

IV. 2, The Definition and Scope of Accent. 

The apportionment of pitch, stress and intonation among syllables of 
a word is usually termed “syllable accent”. This is undoubtedly because 
the accent stands for the “emphasized sound". Since, however, in our 
definition of “accent”, we are concerned with the form of a whole word, 
as viewed from the standpoint of tone, and we are not concerned with the 
stress on one syllable, only the term "word accent” should be preferred. 

(“Word accent” will hereafter be understood in the sense of what is 
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usuallp termed “syllable accent”.) 

In like manner, an attempt will be made to learn whether accent is 
possible in a combination of more than two words, or in a complete sen- 
tence. Furthermore, if “accent” of this kind is at all possible, a study 
will be made of its nature. 

IV. 3. Collocation Accent. 

When two or more than two words are combined to express a certain 
definite meaning, and the combined form is used repeat^ly, -we shall 
have a fixed form of accent on the combination. For example, in Japan- 
ese riko no, kodomo or clever child and akai hana or red flower, certain 
fixed pitch relations are observed between the first and second elements 
of these combinations. Namely, we have pitch accent in combining the 
adjective with the noun. This may be termed “collocation accent”. It 
has previously been illustrated by the example of kanaga kireida that 
accent in a collocation is created where the noun and the auxiliary 
particle are combined. In English, too, when adjectives are used for 
contrast, we have a similar relation between adjectives and nouns, of 
which the former are generally higher and stronger than the latter. 
Furthermore, a kind of “collocation accent” is often created in the com- 
bination of articles and nouns. 

The fusion in that mode of combination of more than two words which 
has been treated above, seems to be less intimate than in the case of two 
words together forming what is quite a new word with a single new idea. 
For example, Japanese kado or gate and inatsu or pine together form 
kadomatsu or New Year’s pine-tree decoration, and me or eye and ha- 
na or nose mehana in the composite verb mehanaga tsukni or to get 
in shape. Such composites denote a single new idea rather than the 
combination of two ideas suggested by the component elements, and this 
leads in many cases to a different form of accent from that of the accent 
of the original words. The ordinary compound can be distinguished in 
this respect from the collocation. English greenhouse, bluebottle, good- 
natured, iUrtempered, harelip, etc. are, among others, good examples of 
ordinary compounds, which are found in several parts of speech. 

The pitch-stress relation between the adjective and the noun, the noun 
and the auxiliary particle, or the article and the noun in Japanese and 



THE SCOPE OF ACCENT 


91 


English have already been treated with in the preceding paragraphs. 
With a view to studying accent in collocation m other languages and 
of the variant forms of accent in these different languages, a sentence to 
the effect that “my brother is a very diligent person," is put into English, 
German, Dutch, French, Italian, Spanish, Japanese, Korean, Mongolian, 
Chinese and Malay as in the following. The records have been made of 
the resulting sentences pronounced in standard pronunciation of their 
respective languages. In the Italian sentence, “youngetr brother” has 
been replaced byi “younger sister”, and in German, Dutch, Italian, 
Spanish, Mongolian and Chinese, “brave" has been substituted for 
“diligent”. 

As they belong to the same family, English, German and Dutch have 
much in common in the structure of sentences, as have the Romance lan- 
guages, French, Italian and Spanish, for the same reason. Of Oriental 
languages, Japanese, Korean and Mongolian are grouped under the same 
category, but Chinese and Malay differ from any of the above-mentioned. 

1. My younger brother is a very dthgent person (English) . 

2. Mem jwngerer Bruder ist em sehr muUger Junge. (German) . 

3. Mi 7 n iongere broer ts een seer dappere jongen (Dutch) . 

4. Mon frire cadet est un gargon tree applique (French) . 

S Mia sorella pid piccola e una ragazsa molto brava. (Itahan) . 

6. Ml hermano menor es un muchacho muy valiente. (Spanish) . 

7. Watakushi no otdto wa hijo ni Jdnben na otoko desu. (Japanese) . 

8. naft aa nin pok kun mrun han sa lam io. (Korean). 

9. minu degu bolbasu masi bayatur ere bolai (Mongohan) 

10 uo® tP giuo ti' i* ioo’kan’nan® ts'. (Chinese) 

11. adik saja seorang anak jang berani (Malay) . 

The relation of pitch to stress in these eleven sentences is represented 
in curves in Fig, 41. The English younger brother, German jungerer 
Bruder, and Dutch jongere broer, though all are of the same linguistic 
family, have different forms of relative pitch and stress of the adjective 
and the noun. For instance, in English, the former is stronger and 
higher, in German the two have the same degree of stress, the latter, 
however, being higher in pitch. Again, in Dutch, both are uttered 
with the same degree of pitch, but the latter excels in stress. 
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Fig. 41—1. 
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rig. 41—4. 
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Fig. 41—6. 







98 


A STUDY OP ACCENT 



When the adjective is preceded by the noun as in French, Italian and 
Spanish, the former is stronger and higher in Spanish, weaker in French 
and lower-pitched in Italian. The relation between the adjective and noun 
in diligent person or brave person that is in the terminal position in 
the sentence, is different from that between the adjective and noun in 
the initial younger brother. In English, the pitch and stress occur in 
the same order both in the initial younger brother and the final diligent 
person, but in German Bruder is higher than jungerer and Junge is 
lower, though stronger, than mutiger. The adjective is lower and 
weaker in Dutch. The reverse is the case with Spanish, but the difference 
in pitch and stress is here relatively small. And in French and Italian, 
the regular order is on the whole comparatively well maintained. Dili- 
gent and person are pronounced with much the same degree of pitch 
and stress in Japanese, Korean, Mongolian, Chinese and Malay. 

Whereas in Dutch mijn is the lowest and weakest element in the sen- 
tence, the corresponding word in Mongolian is the highest and the strong- 
est. In English, Japanese, Chinese and Malay the pitch as well as the 
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stress is medium. The stress is also of medium degree in German and 
Sp’anish, but the pitch is slightly below the average. In French and 
Korean, on the other hand, the pitch is medium with greater degree of 
stress. The remaining words such as the verb, article and auxiliary 
particle, etc. have their respective forms of pitch and stress. All this 
will be summed up in tabular form on the next page. 

The Hindustani sentence of the same meaning has not been recorded. 
But in this language, the difference in pitch and stress between the adjec- 
tive and noun in the combination such as[Ia:kapra] (red cloth) and [atjtja: 
Iarka:](good boy) is almost negligible, except that the rising interval of 
the adjective appears to be a little greater than that of the noun. Normal- 
ly, the conjunction such as [am] (then),[le:kin] (but) , etc. is lower than 
any other word in the sentence. 

In this way, not only in the languages of different families and systems 
but also in those of the same family and system, there are wide diver- 
gences from language to language in pitch and stress of the corresponding 
word in the sentence of the same import and phraseology. The reason is 
perhaps this: the distribution of weight over words is influenced not so 
much by the purport and contents of the sentence as by the force of 
custom. It should be borne in mind, however, that the weight given to 
individual words undergoes modification by the tone of the sentence as a 
composite whole. 

The signs O, A, x given on the next page indicating a predominant 
syllable in each word, stand respectively for high, medium and low pitch, 
and strong, medium and weak stress. The pitch is given on the left and 
the stress on the right in the same column. 

IV. 4. Sentence Accent. 

With a view to the research into the existence and nature of sentence 
accent, speech-dimensions should first be considered, and further observa- 
tions may be facilitated by defining in the most appropriate words various 
terms that are necessary in the following sections. 

Preliminary to discussing the substance of the subject, definitions will 
be given as to two technical terms that will be used hereafter. These 
0JB (i) Expression : by Expression is meant the pitch, stress and speed 
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•• n.p. Nonunatlve Particle. 

of the voice, with which a certain sentence is to be uttered. 


(ii) Pattern'. When two sentences have the same parts of speech 
arranged in the same order, syntax-pattern is said to be the same. 

Speech-dimensions are three in number. 

(a) First, those sentences which are the same in form, i.e. with the 
same pattern, and which express the same “abstract contents”, though 
formed by different words, can be represented by a straight line stretch- 
ing in thei direction of “0 — A.” (By "abstract contents” are meant 
thoughts and emotions.) See, Pig. 42. given on the next page. 

Examples : 

This is his book. 


































THE SCOPE OP ACCENT 101 

That is her pet. 

They are my children. 

Thffle word-groups are of the same pattern, and at the same time they 
are said to have the same tone of speech, i.a sentence accent of the same 
type, phonetically speaking. 

By sentence accent is meant the tone common to those word^-groups 
which express, with the same syntax-pattern, the same “abstract con- 
tents"; in other words, a definite combination of pitch, stress and varia- 
tion of speed within connected speech. (Sentence accent is subject to 
variation to a greater or less degree according as the accent of the com- 
ponent words varies.) In this way, the definition of sentence accent 
becomes identical with that of word accent in the sense that accent is 
a tone peculiar to a given word, phrase or sentence, by which the signi- 
ficance and contents of the word, phrase or sentence are made known to 
the hearer more directly and more precisely. 


A 



OA — same contents. 

(same patterns.) 

OB— ^ame contents. 

(different patterns.) 

OC different contents, 
rdifferent patterns. 

I same patterns 


Fig. 42. 
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It very often happens that dialects are hardly intelligible when spoken, 
though they are fairly intelligible when written. This is chiefly because 
we are little accustomed to the tone of speech, i.e. sentence accent, typical 
of the dialect of a certain district. Rven those m districts where a dialect 
is used, who have been carefully taught to read standard Japanese, may 
find it difficult to understand standard speech when it is spoken by edu- 
cated Tokyo people. It is perhaps for the same reason that we can only 
with difficulty follow foreigners speaking their native tongues, while we 
can fully understand the corresponding languages spoken by Japanese. 

Speaking generally, those word-groups which have a common syntax- 
pattern and expression, viz. which are on the same straight line in the 
direction of “O— A” in the diagram of speech-dimensions are uttered with 
sentence accent of the same type. Accordingly, the corresponding con- 
stituent units of these word-groups, viz. those words which belong to the 
same part of speech, occupy the corresponding position and have pitch and 
stress of the same degree in the sentences consisting of equivalent word- 
groups. When these constituent units are repeatedly used, combined with 
the preceding and subsequent words, the forms of pitch and stress in them 
tend to become fixed, and we have then what may be called “collocation 
accent.” In the case of conjunctions, we have no definite part of speech 
to combine with them. They are the words that are usually lower and 
weaker than those which precede or follow them, and in this instance, 
too, a kind of “collocation accent” occurs. Where sentence accent and 
“collocation accent” conflict, the former usually prevails. For example, 
in description or narration, the fall generally takes place at the end of 
the sentence ; but in Dutch and Spanish in the list of the eleven languages 
dealt with in the previous section, when a combination of a noun and 
adjective with “collocation accent” comes at the end of a sentence, the 
noun IS weaker and lower than the adjective where the contrary usually 
prevails. 

In German, the “collocation accent” is sometimes so flexible as to make 
up for the fall in pitch by the increase in stress. In fine, although it goes 
without saying that different languages have different degrees of im- 
mutability in accent, accent can be arranged, according to the degree of 
immutability, into (1) word accent, (2) sentence accent and (3) 
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phrase accent. 

‘ (b) Secondly, those sentences which have the same abstract con- 
tents, though their syntax-pattern is different, can be arranged on the 
same straight line as “O — B” direction in the diagram of speech- 
dimensions. See Fig. 42. For instance, two sentences, This is his book 
Snd This bosk is his, no doubt give expression to the same concept. 
In addition to those, can be given the following sentence with the same 
abstract contents: This desk and that chair have been given by him 
to his brother. Though the sentence Cheiry blossoms are beautiful 
is analogous with those given above in so far as it is descriptive, all 
these sentences express different kinds of feeling, and cannot be said 
td express the same abstract ideas. 

(c) Thirdly, those sentences which differ in abstract contents are in- 
dicated by “O — C” direction in the diagram of speech-dimensions; see. 
Fig. 42. 

Difficult as it is to name in a sweeping manner all kinds of such con- 
tents, they are broadly divisible into: (1) descriptive; (2) emphatic; 
(3) interrogative; (4) imperative; (5) declamatory, etc. Sometimes 
the sentence falls strictly under one of the above heads, or sometimes 
comprises more than two, a delicate difference in tone being observed 
between the two instances. 

In order to give expression to different contents, different syntactical 
patterns are adopted, to each of which is attached such a tone of speech, 
i.e. sentence accent, as is most suitable for the contents. 

As to how sentence accent varies with different contents, no thorough 
investigations seem to have been made. It has only been pointed out so 
far that the end of a descriptive sentence is generally uttered on a 
falling tone and that the pitch rises finally in a question that requires 
the answer “yes” or “no”. 

One and the same sentence has occasionally different shades of meaning 
according to different modes of expression, i.e. intonation. For instance, 
sa is differently uttered by changing the pitch, if we leave stress in the 
background for the moment. It may be said with a sudden fall, sudden 
rise, level tone, rise, fall, rise-fall, fall-rise, or rise-fall-rise. Such devices 
for producing the same sentence with different forms of intonation are 
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widely resorted to, especially in the English language. According to Mr. 
Palmer, twelve different meanings are possible for the sentence I don’t 
know, and sixteen for Horses are strong. In many cases, we 'quite 
unconsciously make constant use of the device of expressing the same sen- 
tence differently. Closer inspection will surely prove that such is more 
widely resorted to than we imagine. All this is a phenomenon spnAfg 
from delicate mental activity. In this connection, a word may be said 
on the change of relative pitch and stress, as effected by emphasis. 

As is shown in Fig. 43 given on the next page, special emphasis is laid 
on that in That man is strong in (A), and man is emphasised in (B) to 
distinguish it from woman, boy, etc. In (C) and (D), ts and strong are 
emphasised respectively to call attention to the fact of being strong. 

In (A), as is illustrated in (ii) in the section of sound psychology, a 
sudden rise in pitch and a sudden decrease in stress occur on that. 

That in (B) and (C) have an analogy to that in (A) in the rise of 
pitch, but differ from the latter in the relation of pitch to stress, belong- 
ing to the category of (iii) or (iv) also given m the section above- 
mentioned. In short, that in (A) gives an impression of prominence. 
And in this case, that is stronger than man under the influence not of 
emphasis, but of “collocation accent’’. (This will clearly be understood 
if you compare them with That and tnan in (D).) Man is slightly 
stronger in (B) than m (A) and is uttered with much the same degree 
of stress as the preceding that, but is lower and much longer than the 
last-named. 

In (C), is is the highest and strongest, which shows that the word 
doesnot form any “collocation accent" in combination with maw or strong. 

The first three words, i e. that, man, and is, in (D) make for mono- 
tony in pitch : and strong is slightly stressed at the end of the sentence. 

From what has been said, we see that these four types of emphasis 
have been effected in quite different manners. This is because the 
effect of emphasis has been assorted according to the extent of devia- 
tion from the commonest standard expression, on condition that 
“collocation accent” be preserved. “Collocation accent” in that man 
is attained by pitch and stress in (A) and by pitch alone in (B). 
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» SO 40 M M 70 M 99 


Pig. 43. 

Thus, one and the same sentence may be expressed diiferently by differ- 
ent intonations. It should be noted, however, that intonation is not an 
arbitrary thing. As noticed in the previous examples, intonation is sub- 
ject to control, as it were, by phonetic agreements or usage made enforce- 
able within the scope of each language. Supposing that Horses are 
strong can be expressed in sixteen different ways, all the sentences with 
the same word order will likewise be furnished with sixteen modes of 
expression. In other words, there will be sixteen definite types of expres- 
sion common to all of them, from which will result sixteen forms of 
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sentence accent. These observations may lead us to think that there 
are innumerable forms of sentence accent, but this is not really so. Those 
forms of sentence accent which are in common use are comparatively fevir 
in number. For not only short sentences with the same abstract contents 
and the same syntax-pattern have sentence accent of the same type, but 
also those with different modes of word order. In longer sentences, 
too, some types of sentence accent are only extended forms of what occur 
in shorter ones, while others are combinations of different types of them, 
with the result that these can be reduced to a relatively few fundamental 
types. When one and the same sentence has different shades of meaning 
according to its intonation, more than twelve or sixteen forms of sentence 
accent are attached to the one single sentence. We have then one “funda- 
mental tune” or “tune in commonest use” on the one hand, and a large 
number of “off-shoot tunes” on the other. Certain principles seem to be 
at work when a great many “ off-shoot tunes” spring from the “funda- 
mental tune”. For instance, in That man is strong, that men bears 
“collocation accent” so that man cannot be uttered with a higher pitch 
and a stronger stress than that, even though emphasis is required on 
man. We escape from the dilemma, however, by lengthening the sounds 
in man. In this way, although there are quite a number of “oflf-shoot 
tunes”, they can after all be formulated by the combination of a few 
“rules”. Similarly, whereas Chinese monosyllables have only four types 
of accent, the number of the types of accent amounts respectively to six- 
teen and four-and-sixty in dissyllables and trisyllables. In fine, it may be 
asserted that Chinese accent can be reduced to four fundamental types. 
“Tunes” and “rules” are not entirely different things, and may be regarded 
as identical from a certain point of view. In the present study, however, 
they will be considered, for convenience, as independent. In English, the 
“fundamental tunes” are formed by “types”, and the “oflf-shoot tunes” by 
“rules". Nearly all types of sentence accent are created by “rules” in 
French and by “types” in Russian. But some languages such as Hindus- 
tani are devoid of either “types” or "rules”. As has been explained, a 
good many forms of sentence accent are attached to one single sentence 
in English, but in French it is customary to express different contents by 
different modes of syntax-pattern without resorting to the device of “ex- 
pressing the same sentence differently”. In this respect, Japanese seems 
to resemble French, which may explain in some small measure, the relative 
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facility with which Japanese learn to pronounce French as compared with 
English. 

Needless to say, in any language, expression of sentences is not at all 
free from the influence of individuality or of personal mood and manners, 
even if it still remains essentially identical. From this point of view, it 
ntay be said that there is a certain sphere of allowance for individual dif- 
ferences and variations. 

Strange impressions are made upon the mind of hearers when the limit 
of allowance in standard sentence accent is exceeded. Such anomalies 
often tend to become fixed in the speech of individuals. We have then the 
so-called "provincial accent” (includmg of course word accent). This 
“provincial accent” occurs when people speak in languages other than their 
mother-tongue, or in the speech of other districts to which they are little 
accustomed: e.g. English with a (Jerman accent; Tokyo speech with a 
Kyoto accent, etc. When Germans speak in German or people in Kyoto in 
a Kyoto accent, they do so with the accent typical of the country or locality 
so that there is no "provincial accent” here. Within one country, how- 
ever, such provincialism is called dialect, as distinct from a standard 
form of speech. Speaking generally, "provincial accent” includes not 
only the "tone-element” but also the "phoneme-element”. 

IV. 5. Accent of French. 

The distribution of stress over French syllables varies only very 
slightly with different parts of speech, but wide divergencies are ob- 
served in pitch in similar circumstances. For example, less important 
words in a sentence such as articles and conjunctions are normally 
lower in pitch than the other words of the sentence. 

When a French word is uttered in isolation, the pitch rises on the final 
syllable with the simultaneous increase in stress — one of the characteristic 
features of the French language. In connected speech, on the other hand, 
the last syllable of a word that forms a rising sequence of notes in the sen- 
tence is said with a high pitch but becomes slightly weaker ; and when the 
word is uttered as a falling sequence, the corresponding syllable is low in 
pitch but strong in stress. In both cases, the final syllables tend to be 
pronounced long. Whether said in isolation or in connected speech, 
the final syllable gives greater psychological Impressions than the pre- 
ceding ones, viz. it has greater sonority — a phenomenon common to 
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both instances. From this, and in view of the fact that pitch and stress 
lack immutability, French accent may be termed “sonority accent”. 
In German, flexibility such as noted in pitch and stress is a phenomenon 
that accrues from the clash of “collocation accent” with “sentence 
accent.” In French, on the other hand, word accent changes according 
to sentence accent. 

French, in which there is no remarkable immutability in word accent, 
and in which sentence accent is dominant, contrasts strikingly with Eng- 
lish with its many and varied forms of sentence accent. Just as English 
“collocation accent” involves both pitch and stress elements, so also French 
word accent has the two elements. It is, therefore, a matter of con- 
siderable difiiculty to determine once and for all whether French is a 
language of pitch accent or of stress accent. If there should be any 
need to choose among terms other than “sonority accent” given above, 
the term “flexible stress-pitch accent” would be preferable. 

IV. 6. Time In Accent. 

(a) The objective sound attributes that give impressions of tone to 
hearers are pitch, stress and length. When the pitch is more or less fixed 
we have pitch accent, and in like manner we have stress accent when the 
stress is determinate. The question is whether or not the same is the case 
with length. 

Needless to say the human voice has no absolute pitch and stress. 
However, just as in speech of the same style we have medium pitch and 
stress in the respective cases of the voices of men, women, old men, and 
children, so also we have medium speed in speech. Any speech outside 
this medium or standard speech, either faster or slower, cannot be said to 
be normal. 

If we play a gramophone record of a lecture delivered in a grave tone 
and increase the speed of the disc to 1.3 times the regular speed, the tone 
becomes frivolous. Within this limit, however, it will not sound un- 
natural. But if the speed is further increased to over 1.7 times, the 
speech as a whole will come to sound quite unnatural, even though it will 
still be understood distinctly. To increase the rotary speed of gramo- 
phone records means to lift up the pitch of the voice, to shorten the dura- 
tion of speech and intervals between words and to make abrupt the change 
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in pitch. All this certainly exercises a great influence upon ttie'^ne of 
speech. 

Prom this it may be noted that length of sound is no absolute thing and 
is capable of change according to tone and intonation and that when we 
describe vowels as being long or short, we do not mean that there is any 
de’fmite length to qualify them as long or short. But subjectively the long 
vowel is always clearly distinguished from the short one so that when these 
sounds occur in connected speech they can be of any length. For instance, 
the subjective discrimination between long and short vowels is present 
even where a long vowel in rapid speech is shorter than a short vov/el in 
slower speech. 

(b) The vowel in a monosyllable is sometimes pronounced rather long, 
but the short vowel contained in a polysyllablic word is seldom pronounced 
long inasmuch as it has to keep up relative length as against other vowels 
in the same word. For example, the word hook is sometimes pronounced 
long, but hook in the compound bookkeeper is usually short. Further- 
more, monosyllables in languages of stress accent are sometimes stronger 
and sometimes weaker than the preceding and subsequent words, but in 
words of more than two syllables, the stress is relatively fixed as between 
component syllables. There seems to be, in my opinion, a more or less 
close correspondence between the two phenomena. • 

Accent may further be considered from the standpoint of the time ele- 
ment of tone. However, since the accent of this type is common to almost 
all languages and the length of vowels can be compared more favourably 
with the kind of phonemes, there seems to be no special need for the term 
"length accent”. Only it should be borne in mind that the subjective form 
of words includes as important factors, not only stress and pitch but also 
length. 

(c) Inasmuch as the long vowel is more sonorous than the short one, 
the “betonte Silbe” is longer than the “unbetonte Silbe” in French, a 
language of “sonority accent”, and in Russian, a language of “stress 
accent”. This is also the case with some German words such as jeder, 
liebte, uber, htigel, etc., in all of which the vowel in the “betonte Silbe” 
is close. It may be concluded that after all length has only a supple- 
mentary significance in languages of “sonority accent”. 
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IV. 7. Accent Peculiarities of Various Languages. 

In the foregoing, I have dasaified the accents of various languages into 
the three kinds of pitch, stress and sonority accent according to the char- 
acteristics of pitch and stress in speech-sound. The classification is made 
chiefly on the basis of immutability of the three sound attributes. . '.Of 
course, it is admitted that those classed as languages of "pitch accent” or 
of “stress accent” differ greatly within the group, hence any hasty conclu- 
sion should better be avoided. For example, despite the fact that the 
stress is determinate, the “accented syllable” in Russian is made pro- 
minent by means of pitch. 

Below I make a general survey of the characteristic features of accent 
in various languages so as to facilitate understanding. 

Pitch accent: 

1. Chinese. ^ 

2. Japanese. 

3. Korean. 

4. Mongolian. 

Stress accent: 

6. Russian. 

6. German. 

7. English. 

8. Hindustani. 

Sonority accent : 

9. French. 

1. The accent of Chinese has a most distinctive raison o’/tre. In 
Chinese, the pitch takes four forms, i.e. level tone or Joq p'ig, sudden 
rising tone or ^ia p'ig level-rising tone or Jap Jau, and falling tone 
or t'gy J3Q, by means of which the monosyllabic words are distinguished 
from one another. 

2. Japanese is also a language of “pitch accent”. Its pitch accent is 
comparatively well fixed, which is characteristic of its word accent as well 
as its collocation accent Maybe, all this is due to the structure and 
phraseology typical of the language. 

S. Although Korean pnma facie resembles Japanese, further detailed 
studies will be needed before we can arrive at any definite conclusion as 
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regards the subject. In Korean, too, the stress varies but slightly between 
neighbouring syllables, but there may be observed a considerable differ- 
ence in stress throughout the sentence. 

4. In Mongolian, the pitch changes proportionately with the stress; 
and the final syllable is generally sirong and high. And the degree of 
pittih and stress largely depends upon the position of words in the sen- 
tence. Taking this into account, Mongolian cannot instantly be defined 
as a language of pitch accent in the strict sense of the term. 

5. Stress is determinate in Russian. The “accented syllable” in 
words that form a level or rising sequence of notes in the sentence rises 
invariably. But the syllable is not always stronger than others. 

6. In German, stress is pretty well fixed. As illustrated in section 4, 
Chapter III, with the exception of “Uhu” uttered with nearly level accent, 
the ratio of stress between two syllables in the dissyllables that twice occur 
in the dialogues, remains on the whole unchanged. This phenomenon may 
also be recognized in other languages of stress accent, but in German pitch 
also typifies the language. The characteristic features are more fully 
exhibited in the rise and fall of tone than in the degree of pitch. For ex- 
ample, in gehBrt, ge- has a falling tone, and -hdrt a rise-fall. In some 
syllables on the contrary such as tiber, the so-called “betonte Silbe” is 
weaker. To take a general view, therefore, the element of pitch may be 
said to figure in German accent. 

7. The order of stress among English syllables is quite immune from 
alteration, and the reverse appears to be the case with the ratio of stress. 
In the majority of cases, pitch and stress do not go in proportion. 

8. In Hindustani, all the dissyllables and polysyllables have a rising 
pitch and there is but little difference in stress between syllables. Stress 
is the more fixed. 

9. The final syllable of words uttered separately is high-pitched and 
bears strong stress in French. As has previously been pointed out, how- 
ever, both pitch and stress undergo modification by the tone of the 
sentence. 

We hear speech-sound not as individual phones or separate syllables but 
as a complete “Gestalt” from the standpoint of both tone and phbneme, 
which we shall term “auditory unit”. For example, although Hindustani 
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is a language of stress accent, the difference in stress between syllables is 
quite small — so small that if the words are uttered with the same pitfch, 
we very naturally fail to discriminate between them. As a result,, there 
is a loss of intelligibility. (Fig. 44-A). This loss is compensated for, 
however, by means of the rise in pitch. Each word then forms a “Ge- 
stalt” from the standpoint of tone and becomes one “auditory unit”; a 
great deal of intelligibility being obtained. (Fig.44-B). 

It can easily be imagined that 

^ accent as “Gestalt” is closely related 

to the “auditory unit” at the same 
time that it becomes fixed as sound 
® attribute, since it represents one of 

the tonal peculiarities of words. 
But in the case of Hindustani, 
the “auditory unit” has nothing directly to do with stress, the fixed 
element of accent, there being on the other hand, a close relation between 
itself and pitch, the non-flxed element. When both pitch and stress go to 
exhibit essential characteristic features of words as in German, each word 
is considered as an “auditory unit” by reason of its accent. 


Fig. 44. 


Now let us deal with other languages. First of all, a word becomes an 
“auditory unit” in Chinese by reason of its pitch accent, while a word or 
a collocation is the unit in Japanese and Mongolian. In Russian, too, 
a word is made a unit by stress accent and the rise in pitch on the “ac- 
cented syllable”. Again, in English, a word becomes a small unit by 
reason of its stress accent, and a combination of words forms a greater 
unit through the undulation of pitch. A phrase is made a unit in French 
chiefly by pitch. 

On the strength of what has been said, I am confident that we have now 
come to conceive clearer ideas of many features of the determinate and 
non-determinate elements of tone, and of their relation one to the other, 
as they occur in various languages Accordingly, it may be considered 
inappropriate to make any attempt at a definition of accent that con- 
siders the fixed element alone without paying the least attention to the 
non-fixed elements. This is especially so where pitch and stress go in 
proportion, as in Mongolian, with the result that both can be regarded 
as semi-determinate, as it were. 
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Research into the nature and scope of accent in the light of experimental 
phonetics has been made, though in a very sweeping manner, in the 
preceding chapters. Of this, however, the author is not prepared to say 
that no stone has been left unturned, and it has to be frankly admitted that 
there remain’ many things untouched. In fact, much depends on 
future studies in which I trust everything will be dealt with on a much 
larger scale: 

1. First of all, the study has to cover all types of accent in all languages 
of the world, i.e. research into the distribution of accent is necessary. In 
this present study of mine, experimental investigations have been made 
with regard to several of the languages. 

2. Secondly, we shall have to go into details as to accent as it occurs 
in each language. Here, types and rules of word- and sentence-accent will 
be classified, and an analysis will be made of the relation between 
sentence accent on the one hand and contents of speech and word 
accent on the other. To this point, however, reference has been made 
here and there in my present study. 

3. Thirdly, we have to know how the present system of distribution of 
accent has come into existence. 

The writer will consider himself amply repaid for his labours if his 
present study should contribute, even a little, to the elucidation of the 
many problems that have hitherto remained unsolved and also if it should, 
as giving practical guidance in studies in that direction, serve for the pur- 
poses of the international studies of languages made under the auspices of 
the International Linguistic Association. 

In conclusion, the writer has to express his sincere gratitude to those 
who have provided him with material by speaking at the microphone and 
to his colleagues for the good wiU shown and assistance extended to him 
during the course of the experimental work. He feels no leas grateful 
to Mr. Masato Kajiyama who has devotedly assisted him at every stage 
of the experiments and has arranged a large supply of materials to 
make this study possible and also to Mr. Koji Kosaka who has been en- 
gaged in the work of enumeration and the production of diagrams. 




APPENDIX 

Measurement of the Duration of Speech-Sound, 

; .Though the statistical measurement of the quantity of speech-sound 
seems to be of no fundamental importance to the problem of accent, 
it may serve to help towards the understand!^ of the distinctive tonic 
characteristics of the languages treated in the present volume. 

With this in view, the measurement of the length of sounds obtained 
from the oscillograms of English, German, French, Japanese, Chinese, 
Korean, Hindustani, Russian and Mongolian is given in a tabular form 
on the following pages : 

(Note : If we want to calculate only the number of syllables uttered 
within a certain time unit, we may use a watch, but for the clear dis- 
crimination of delicate differences between individual phonemes and 
for the exact measurement of their time, this method is not sufficient.) 

1. The heading on the upper left-hand corner in the subsequent 
tables indicates the numbering of those sentences treated in the third 
chapter of this volume. 

2. In the second column is given the duration of time (the time unit 
being one second) required for the utterance of the respective sen- 
tences. This entire length of time divided by the number of syllables 
in the sentence (enumerated in the third column) and the number of 
phonemes (the fourth column) give the average time for each syllable 
(the fifth column) and the average time for each phoneme (the sixth 
column) in each sentence. The figures in the eighth column indicate 
either the lengths of pauses or the lengths of voiceless consonants, and 
the figures in the seventh column indicate the duration of time needed 
for the utterance of vowels, semi-vowels as well as voiced consonants 
in each sentence. 

3. The enumeration of syllables is made in an ordinary manner, but 
in the case of the Japanese sentences, the unit is the Icam that is con- 
sonant-vowel as used in the Japanese syllabary (which is not the same 
as an alphabet). 
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4. Sound unit means an individual phoneme in the ordinary sense 
of the word. Thus, Japanese hana and German Obst consist pf 4 
phonemes respectively, while in the ease of English cannot, though 
phonetically two w’s fuse into one, each is counted separately. 

Again, like the Japanese verbal ending desu of which the finals 
usually becomes mute, a bisyllable word containing a voiceless element 
is counted as one syllable, and a diphthong consists of two phonemes. 


Chinese 
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0.0 

0 220 


io 3 

46 

12.2 

o.i 39 

0090 
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38 
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Note: 

A=sentence number. G=time required for voiced ele- 

B=entire time. ments. 

C=number of syllables. H=tirae required for voiceless 

D=number of phonemes. elements. 

E=average time for each syllable. I=average time for each voiced 
F=average time for each pho- syllable, 

neme. 

In Japanese, also, a long consonant or a long vowel is counted as 
two, but a long vowel in other languages makes one phoneme. We 
shall now examine in each language the degree of uniformity in aver- 
age syllable time, average phoneme time and average voiced-syllable 
time mentioned in the above table and those that follow. 

In English, both syllable time and phoneme time in the long sen- 
tence 1. are much longer than those in the short sentences 2. and 3. 
One reason for this may be that there is a very slight diiference 
between average voiced-syllable times m those three sentences. 

In German the time required for the utterance of each of the first 9 
sentences which contain a good many long vowels is much longer on 
the whole than that required for each of the latter conversational sen- 
tences. In the former, the average syllable time is 0.247 sec. the aver- 
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Japanese 
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Hindustani 
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French 
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Russian 



Mongolian 
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age phoneme time being only 0.096 sec., while in the latter they are 
0.198 sec. and 0.076 sec. respectively. 

In French, though 6 sentences either long or short were spoken by 
two men and one lady (see Chapter III, 6), the average syllable time is 
quite uniform. This fact evidently shows that both sentence accent 
and word accent have much in common in their qualities. 

In Japanese, with the exception of the last sentence, I eongraUdate 
you on your graduation ceremony, the average phoneme time and the 
voiced-syllable time agree with each other. (Note : it is quite na- 
tural that the sentence just referred to should differ from other sen- 
tences in its tonic style as it is a part of a congratulatory message given 
by the Minister of Education at the graduation ceremony.) 

In Chinese, phoneme time shows more uniformity than syllable time, 
whereas in Korean a greater uniformity is found in phoneme time with 
a single exception in the last 2 sentences Where are you going ? and 
I don’t know. 

In Hindustani both syllable time and phoneme time show an equal 
degree of uniformity, while in Russian there is a great deal of discre- 
pancy between them, and in Mongolian there is only a slight unifor- 
mity in phoneme time. 



A 

B 


D 

E 

F 

G 

English 

S.56 

44:56 

0.2J7 

Dec. 

0087 

ft 

7 M 

% 

226 

fmc, 

0.166 

German 

*59 

4 ‘'S 9 

0225 

0.087 

755 


0.169 

French 

340 

60.40 

0 227 

0 in 

7S 0 

22 0 

0.178 

Russian 

211 

83:17 

0.193 

0 090 

83* 

16 3 

0 165 

Korean 

^3 

57 '43 

0179 

0 074 

905 

9-5 

0.164 

Japanese 

r 76 

(76 24) 

0 130 

0073 

838 

16.2 

0.1 15 

Chinese 

263 

35 65 

0 239 

0 090 

962 

3 ® 

0231 

Hindustani 

235 

65:33 

0 187 

coSo 

834 

166 

0.157 

Mongolian 

2 30 

70-30 

o 2 j 7 

c 101 

935 

65 

0.220 


Note: A=number of phonemes divided by the number of syllables. 

B=proportion between syllables containing two phonemes and 
those containing three phonemes. 

C=mean average time for each syllable. 

D=mean average time for each phoneme. 
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E=:average time for voiced elements. " 

F=average time for all voiceless elements. 

G=mean average time for each voiced syllable. 

The table on the preceding page shows the average time for *each 
item in the respective languages. 

The first column shows the result obtained by dividing the number 
of phonemes which constitutes one syllable. 

In the second column is given the percentage of two or three 
phonemes. 

This percentage is the result obtained by calculating the numerals 
given in the first column. In English, for instance, the number of syl- 
lables consisting of two phonemes is 44% and that of those consisting 
of three phonemes is 56%. In Japanese all the syllables are confined 
to those consisting either of two phonemes or one phoneme. 

The average of syllable time measured in this manner differs con- 
siderably according to the sentences themselves, not to speak of the 
tone or style of speech or individual propensity of the speaker, but the 
average times given in the table on the preceding page may, on 
the whole, be regarded as the speed of speech-sound in a language. 

Then we shall see how this stands relatively in respective languages 
dealt with in this study ; — 

Syllable time is longest in Chinese showing 0.239 sec. (the character- 
istic feature of Chinese is to form a word by any one single syllable as 
already described in Chapter HI). Mongolian is just about the same as 
Chinese in this point. In French, German and English it is about 0.22 
sec., whereas in Russian, Hindustani and Korean it is about 0.19 sec. 
But in Japanese it is extremely short being only 0.13 sec. This is no 
doubt due to the peculiar structure of Japanese syllables. 

In the length of voiced-syllable time also, Chinese and Mongolian 
occupy the foremost position showing 0.231 sec. and 0.220 sec. respect- 
ively. French, German, English, Russian, Korean and Hindustani 
come next ranging between 0.178 — 0.157 sec., while in Japanese it is 
only a half of Chinese showing 0.115 sec. 

The proportion of syllables consisting of two phonemes and those of 
three phonemes is as follows: — 

In Russian it is 83%:17% showing by far the greater number of the 
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former. Mon^rolian comes next. In Hindustani, French and Korean 
the proportion is about 60%:40%, while in English and German k is 
about* 40% :60 % . In Chinese it is 35% :65 % , whereas in Japanese the 
proportion of syllables consisting of two phonemes and a single pho- 
neme is 76 % ;24% showing the nearest approach to Russian in respect 
to tihe number of syllables consisting of two phonemes. 

The time of one single phoneme, however, has not so much variety in 
length!, as that of one single syllable. According to my calculation, it 
is longest in French and Mongolian extending as far as 0.101 sec., but 
this decreases in the following order: — 

Chinese, Russian, English, German, Hindustani, Korean and Japan- 
ese. 


The End. 
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